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(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GAAGCAGTGT GTATCTATGA TTATATCTCT GTTCATCTAT ATATTTTTGA CATGTAGCAA 
CACCTCTCCA TCTTATCAAG GAACTCAACT CGGTCTGQGT CTCCCCAGTG CCCAGTGGTG 
GCCTTTGACA GGTAGGAGGA TGCAGTGCTG CAGGCTATTT TUTITITICT TACAAAACTG 
TCTTTTCCCT TTTCCCCTCC ACCTGATTCA GCATGATCCC TGTGAGCTGG TTCTCACAAT 
CTCCTGGGAC TGGGCTGAGG CAGGGGCTTC GCTCTATTCT CCCTAACCAT ACTGTCTTCC 
TTTCCCCTTG CCACTTAGCA GTTATCCCCC CAGCTATGCC TTCTCCCTCC CTCCCTTGCC 
CTGGCATATA TTGTGCCTTA TTTATGCTGC AAATATAACA TTAAACTATC AAGTGAAAAA 
AAAAAAAAAA AAAACTCCAA GGGGGGGCCG GTACCCAATT CCCCCTATAN TGAGTCOTAT 
TACAATTCAC TGGGCCGTCG TTTTACAACG TCGTGAATGG GAAAACCTGG GCGT 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GGCACGAGTC CGGGATGAGC TCAGCCGCGG CCGACCACTG QGCGTGGTTG CTGGTGCTCA 
GCTTCGTGTT TGGATGCAAT GTTCTTAGGA TCCTCCTCCC GTCCTTCTCA TCCTTCATGT 
CCAGGGTGCT GCAGAAGGAC GCGGAGCAGG AGTCACAGAT GAGAGCGGAG ATCCAGGACA 
TGAAGCAGGA GCTCTCCACA GTCAACATGA TGGACGAGTT TGCCAGATAT GCCAGGCTGG 
AAAGAAAGAT CAACAAGATG ACGGATAAGC TCAAAACCCA TGTGAAAGCT CGGACAGCTC 
AATTAGCCAA GATAAAATGG GTGATAAGTG TCGCTTTCTA CGTATTGCAG GCTGCCCTGA 
TGATCTCACT CATTTGGAAG TATTATTCTG TCCCTGTGGC TGTCGTGCCG AGTAAATGGA 
TAACCCCTCT AGACCGCCTG GTAGCCTTTC CTACTAGAGT AGCAGGTGGT GTTGGAA1TA 
CCTGTTGGAT TTTAGTCTGT AACAAAGTTG TCGCTATTGT GCTTCATCCG TTCAGCTGAA 
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CAGGAGGATG GATACAGCCG CGAGGCTAAA AAACGGATTT CCTCTTCCTA GCTTAAAATC 600 

TGATTTACAC TGTTTTGTTT TTTAAGAAAC AAAAGTGCAT AGTTTAGATT TTTTTTTTTG 660 

TTGAATATGT TTGTTCTTGG ACTTTATGAG AGAGTCTTAT AAGAATCACG ATTTTCTACA 720 

CCTGTCATTG AGCCAAGAAA GTCCAGTTTA TGACACGTAT GTACTAGTGA ACACCGTCCT 780 

CGATCTGTAC GAAATGTGAA ATGTTTAGGG ACATCTCCAT GCTGTCACTT GTGATTTGCC 840 

CTCTTATGTA TTTTGGTCAT ATTGCCAACT GGAAAGTCAA AATTTTCTAA CAACTTTAAG 900 

TAAGTTCTTT GAAGACTTAG TGCTGTTTTT AATCCAGTTT AGAAAGTAAC TTAATTTTAA 960 

15 TACCACTACT AAAAATTCGA AAATTTCTTC TTTAATCACA TTCAATATGG TTAAAAGAAC 1020 

AACACTAATT GACATTGCGT GGGCTTTTTC TCCCTTTGTT TAAAATGTCA TTTGTTGAGC 1080 

AAGAGTTGTA TAGTATTATC TACTTACTTG AGGCTGTTAA TTTTTCATTA CAGTGTTTTG 1140 

20 

TAAATGTATC CACGAGACCA TGATGCATTG TTTTGTGCTC AACTTGTGTT TTGTATTTAA 1200 

AGCATTTTGA ATGAAGTGTA TTCTATAAGC ATTTAATATT TATGCTCTTT AGAATGGAAC 1260 

25 ACAGAAAACA AACCTTATAA GTCCTGATTA ATCTGAACCA ATAACCTGTG TGGCCTACAA 1320 

AGTATAATTC TATTAAATGT TCCTTAAAAC AAAAAAAAAA AAAAAAAAAA AAAA 1374 



30 



(2) INFORMATION FOR SEQ ID NO: 47: 



<i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GAATTCGNCA CGAGATTACT TGGACATGAA AGAACTCAGG TTCAAGTTTA TTCATTTACT 60 

AAGTTAGTTA AATCATGTGC CTTCCATGAG CCTTCATTTG GTAACTTGGA AAATGGAAAT 120 

45 

AATAACACTA GTCATATATA TTCTACACTG CTACCATATG GACCAAAGGG ATTATAGATT 180 

ACAATCACCA TCATTCCTGC TGACAGGTAT ATAGAAAACA A1TTCATTGA AGAAAAGTCC 240 

50 TTACATTTAT CCTTTTCCTA ATATCTGCAT GGGTAAACTA ATAAATATAG TCATTAGAAA 300 

ACCCTTATTA TTATTATTAG TTCAATGTGA GAACTGCTGC AGAAAAAATA TGCTTTATAA 360 

^ TATTTTCTTG AATATACATA ATATTCATAA ATTTTCAAAT CATTGAAAAT TACCTTAAAA 420 

TTGGAAAAAA TGTGCATTTC TACTCATATA ACAGTATAAA ATTCCTATGT CAATCTCTTT 480 

■l-rrrrrrriT TGTTTTGAGT TGGAGTCTCG CTCTGTCGCC CAGGCTGGGC AACAGAGCAG 540 

60 GACCCTGTCT TAATTAAAAA AAAAAAAAAA AAACTCGAGG GGGGCCCGGT ACCCTA 596 
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5 (2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

15 CACATGAAGA CACACAGTGG TGAGAAGCCC TTCCGCTGCG CCCGCTGTCC TTATGCCTCT 60 

CCTCATCTGG ATAACCTGAA ACGGCACCAG CGCGTCCATA CAGGAGAGAA GCCCTACAAG 120 

TGCCCCCTCT GCCCTTATGC CTGTGGCAAT CTGGCCAACC TCAAGCGTCA TGGTCGCATC 180 

20 

CACTCTGGTG ACAAACCTTT TCGGTGTAGC CTTTGCAACT ACAGCTGCAA CCAGAGCATG 240 

AACCTCAAAC GTCACATGCT GCGGCACACA GGCGAGAAGC CTTCCGCTGT GCCACCTGCG 300 

25 CCTATACCAC GGGCCACTGG GACAACTACA AGCGCCACCA GAAGGTGCAT GGCCACGGTG 360 

GGGCAGGAGG GCCTGGTCTC TCTGCCTCTG AGGGCTGGGC CCCACCTCAT AGCCCACCCT 420 

CTGTTTTGAG CTCTCGGGGC CCACCAGCCC TGGGGACTGC TGGCAGCCGG GCTGTCCACA 480 

30 

CAGACTCATC CTGAACTAGG TCCTTCTTCC CCATGTTTTA TACAGACGGA CCAGAAGCCA 540 

CCTTTTTCTC CCCCGCTGGC CAGGGGCTCC ACACAGACTA ACGTAGGCAC TATAAGGACC 600 

35 AGCCCAACCC CATGGGCGGG GGGGCCCATA TGGACCAGGG GACCTTGCCT TGACTGAGGC 660 

ACTTCACGAG CTCAGTGAGA AGGGCCCTGT ATTCACCTCC ACTGCCCCCA GGGGCTGTCG 720 

ACAAACCGGC TGGGGGACTG CCCAGCCTCC CACCTGTTTA TTTAACTTAT TTCAGTGCTT 780 

40 

TATAATAAAG GAAACACTAA CAAAGCCATG TCTATGCTGA ATTGGCAATG GCAGGCAATT 840 

TQGCCTTACC C 851 

45 

(2) INFORMATION FOR SEQ ID NO: 49: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GTGAAATGAA AACAGTCTTT TTATAGCCTT TAGCTTGTGA GTTTGGAAGT TTGGGGGGTC 60 
60 TTATGTTTGT TTTGCCTCTT CTGTTTCTTG GAGGAGAGTT GAGGCTTTTC TTAGGTGCAT 120 
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ACACAGACCC AGGTGAACAC GCTGACTGTG AACCTGCCCT GTATCCGGAG CTGTGCTGGG 180 

CACTGAGGGG ATGCAACAAA ATTAGGAGAG GWTCCTTGCT CCCAACGTCT ACTTCTCCTA 240 

5 

CCTCAACAGG GGTCCAGGGT GCAGTGAACT CAGTTCTTGG CCCTTGGGTG AGGATTCATG 300 

GATGAATGAA AGCTAGACCT GATGGGGAGG CATTATGACT AAATAGGCCC AGCCTCCTTC 360 

10 CCTTCCAGCT CTGTCCTAGG AGCATAGGCG GGAAATCTGA GTAGAGTCTG ACTGCAGTTT 420 

TTGCTTATGA TTTGTAAAAG CCGTCATGGG GTCAATAAGA AAATAGGGGT GATGGAGGGG 480 

GAGAAGCCCA GGACTGGGAG AATCGCACGT GCCCCAGGGG TTTTCACCAA GGATTTTCAA 540 

15 

GACAAACTGG AGTAAGAATT AAAGCCCCAG AGGATTTAAT TATCCTGGTT TGCAAAAGAG 600 

CCTCCCATGC CAGTACCGCC CAGCCTTGGA GGCCGGAATG CTCATGGCCC CTGTGGTCTG 660 

20 CTTGTCCTTC AGCCCATGCC CAGCAGATAC CTCTCTGACT GGAGACGGGC TCAAAGCTGG 720 

ATTAGAAAGG GGAGMGGCAC TTGTGACTTT GTTTGACTCT GTGACTCACT TCCTCGCTCA 780 

CACCTTGTTT GAACTACTGG ACTTTCAACT GGCTTTCCTT AGGTCAGGCA AGCAGACAGC 840 

25 

TCCCCACTGA AGAGGTCTGT ACAGTGACAA CCCGGGCCGG CAGCAAGGAC ACAGATGCAG 900 

CCACAGTAAG GCTCCATCAG GACTGGGTCA GTGATGGCAA CAGGATGGCC AAGGATGGCT 960 

30 CTAGAACAYT CTGTCCATGC GTCACTCCCC CCAGTTTTRT TTTTAGCTTT GGCTTCAGGG 1020 

AGTGACAGCC ATCACAAATA GCCACATTCT GCTCTACTCT CCAACATACC AGATTSTACA 1080 

CTGTTGTTAT TTCATGAGAC GTGAATGTTG CAGAGAGTGG GGGGATTCTG GTTGTTAAGG 1140 

35 

AACTTACACT GGGGAGCTTT ACTCTTCCGT GTCAACAATG TGACTACATG TTCTCCAGAT 1200 

TAGCCACACA TGCAAACATC AGTGTCCTTC TAGCTTTANC CGAGAAAGAA ACCAGTCCCA 1260 

40 GGGAATGAAT GGTGGTCTCC CCACTCOCGG CAGCACTTTA GGCAGCOCAT AAGCTATGCG 1320 

AGAATGTGAA CGCTCACCTT GCTCCGTCAC GGTTCTGACC TACCACATAA ACAGGAAGAA 1380 

GCCAGTGACC GGAACAGCTC TAGGAATAAC AAGTCAGAAT AGAAGTGTCC TTTATATTAC 1440 

45 

CAGAAAATAT GGGCTTGGCC TAAGTCGCTG TCTCCTAACC TGCCGGGGTC ATTCCCCACC 1500 

AAACACCCCA TACTAAGGAG CCATGAGCCA CCTGGACATT CACCTTTTCT TTGACCATCT 1560 

50 GGAGTCTGGG GCAACTTAAG GAAGGCNCCA CACAGTGGTG CAGGCACATT TCCAAGCGTA 1620 

GGTGTCCCTG GCTTTTGTGG CCAAAGCTAG TGTTATGGTC AACAACAGGC CAGGGTCTGT 1680 

GGGGCACTGA CCTTGAAAGT GGCAAAATGG AGGTTTCACA GGCTGTGCGG GAGCAGGACG 1740 

55 

GCTTGCTTCA TCTAACAATC TCAGTTTCCT TTAAAAAAAG AAAGAAAGGA AAAGATTTCA 1800 

TAAGCAGGTG TCAGTGGACA GTTTAAGYAC TTAACCATTT CTCTTTCTTC TTATGGATGT 1860 

60 GAACTGTGCT GTGGATAAAT CATTTGTATT TCTTGAATGT TCTCTATGAC TAACAGTTAT 1920 
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TAAGTCGGTT GTGTATATGT GTAACTAATG TAACTGCCTT TTAAAATTTC ATTACAATAA 1980 
AAATGACTTT GCTCTGAAMA AAAAAAAAAA AAAAACTCGA 2020 



10 



(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
15 (D) topology: linear 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

ATGAAGGGTC GTTGGTGGGA AAGATGGCGG CGACTCTGGG ACCCCTTGGG TCGTGGCAGC 60 

AGTGGCGGCG ATGTTTGTCG GCTCGGGATG GGTCCAGGAT GTTACTCCTT CTTCTTTTGT 120 

TGGGGTCTGG GCAGGGGCCA CAGCAAGTCG GGGCGGGTCA AACGTTCGAG TACTTGAAAC 180 

25 GGGAGCACTC GCTGTCGAAG CCCTACCAGG GTGTGGGCAC AGGCAGTTCC TCACTGTGGA 240 

ATCTGATGGG CAATGCCATG GTGATGACCC AGTATATCCG CCTTACCCCA GATATGCAAA 300 

GTAAACAGGG TGCCTTGTGG AACCGGGTGC CATGTTTCCT GAGAGACTGG GAGTTGCAGG 360 

30 

TGCACTTCAA AATCCATGGA CAAGGAAAGA AGAATCTGCA TGGGGATGGC TTGGCAATCT 420 

GGTACACAAG GAATCGGATG CAGCCAGGGC CTGTGTTTGG AAACATGGAC AAATTTGTGG 480 

35 GGCTGGGAGT ATTTGTAGAC ACCTACCCCA ATGAGGAGAA GCAGCAAGAG CGGGTATTCC 540 

CCTACATCTC AGCCATGGTG AACAACGGCT CCCTCAGCTA TGATCATGAG CGGGATGGGC 600 

GGCCTACAGA GCTGGGAGGC TGCACAGCCA TTGTCCGCAA TCTTCATTAC GACACCTTCC 660 

40 

TGGTGATTCG CTACGTCAAG AGGCATTTGA CGATAATGAT GGATATTGAT GGCAAGCATG 720 

AGTGGAGGGA CTGCATTGAA GTGCCCGGAG TCCGCCTGCC CCGCGGCTAC TACTTCGGCA 780 

45 CCTCCTCCAT CACTGGGGAT CTCTCAGATA ATCATGATGT CATTTCCTTG AAG1TGTTTG 840 

AACTGACAGT GGAGAGAACC CCAGAAGAGG AAAAGCTCCA TCGAGATGTG TTCTTGCCCT 900 

CAGTGGACAA TATGAAGCTG CCTGAGATGA CAGCTCCACT GCCGCCCCTG AGTGGCCTGG 960 

50 

CCCTCTTCCT CATCGTCTTT TTCTCCCTGG TGTTTTCTGT ATTTGCCATA GTCATTGGTA 1020 

TCATACTCTA CAACAAATGG CAGGAACAGA GCCGAAAGCG CTTCTACTGA GCCCTCCTCC 1080 

55 TGCCACCACT TTTGTGACTG TCACCCATGA GGTATGGAAG GAGCAGGCAC TGGCCTGAGC 1140 

ATGCAGCCTG GAGAGTGTTC TTGTCTCTAG CAGCTGGTTG GGGACTATAT TCIX3TCACTC 1200 

GAGTTTTGAA TGCAGGGACC CCGCATTCCC ATGGTTGTGC ATGGGGACAT CTAACTCTGG 1260 

60 
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TCTGGGAAGC CACCCACCCC AGGGCAATGC TGCTGTGATG TGCCTTTCCC TGCAGTCCTT 1320 

CCATGTGGGA GCAGAGGTGT GAAGAGAATT TACGTGGTTG TGATGCCAAA ATCACAGAAC 1380 

5 AGAATTTCAT AGCCCAGGCT GCCGTGTTGT TTGACTCAGA AGGCCCTTCT ACTTCAGTTT 1440 

TGAATCCACA AAGAATTAAA AACTGGTAAC ACCACAGGCT TTCTGACCAT CCATTCGTTG 1500 
GGTTTTGCAT TTGACCCAAC CCTCTGCCTA CCTGAGGAGC TTTCTTTGGA AACCAGGATG . 1560 

10 

GAAACTTCTT CCCTGCCTTA CCTTCCTTTC ACTCCATTCA TTGTCCTCTC TGTGTGCAAC 1620 

CTGAGCTGGG AAAGGCATTT GGATGCCTCT CTGTTGGGGC CTGGQGCTGC AGAACACACC 1680 

15 TGCGTTTCAC TGGCCTTCAT TAGGTGGCCC TAGGGAGATG GCTTTCTGCT TTGGATCACT 1740 

GTTCCCTAGC ATGGGTCTTG GGTCTATTGG CATGTCCATG GCCTTCCCAA TCAAGTCTCT 1800 

TCAGGCCCTC AGTGAAGTTT GGCTAAAGGT TGGTGTAAAA ATCAAGAGAA GCCTGGAAGA 1860 

20 

CATCATGGAT GCCATGGATT AGCTGTGCAA CTGACCAGCT CCAGGTTTGA TCAAACCAAA 1920 

AGCAACATTT GTCATGTGGT CTGACCATGT GGAGATGTTT CTGGACTTGC TAGAGCCTGC 1980 

25 TTAGCTGCAT GTTTTGTAGT TACGATTTTT GGAATCCCAC TTTGAGTGCT GAAAGTGTAA 2040 

GGAAGCTTTC TTCTTACACC TTGGGCTTGG ATATTGCCCA GAGAAGAAAT TTGGCTTTTT 2100 

TTTTCTTAAT GGACAAGAGA CAGTTGCTGT TCTCATGTTC CAAGTCTGAG AGCAACAGAC 2160 

30 

CCTCATCATC TGTGCCTGGA AGAGTTCACT GTCATTGAGC AGCACAGCCT GAGTGCTGGC 2220 

CTCTGTCAAC CCTTATTCCA CTGCCTTATT TGACAAGGGG TTACATGCTG CTCACCTTAC 2280 

35 TGCCCTGGGA TTAAATCAGT TACAGGCCAG AGTCTCCTTG GAGGGCCTGG AACTCTGAGT 2340 

CCTCCTATGA ACCTCTGTAG CCTAAATGAA ATTCTTAAAA TCACCGATGG AACCAAAAAA 2400 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 2432 

40 



(2) INFORMATION FOR SEQ ID NO: 51: 

45 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANBEDNESS: double 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GACGCTGGGG GCGGGTGGGG GCGCGGGGTA CCGGGCTGGA CGGCCGGCCG GCGCCCCCTC 60 

55 

ATTAGTATGC GGACGAAGCG GCGGGCTGCG CGGAGNGACG TCCCCTGCAG CCGCGGACCG 120 
AGGCAGCGGC GGCACCTGCC QGCCGAGCAA TGCCAAGTGA GTACACCTAT GTRAAACTGA 180 



60 



GAAGTGATTG CTCGAGGCCT TCCCTGCAAT GGTACACCCG AGCTCAAAGC AAGATGAGAA 



240 
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GGCCCAGCTT GTTATTAAAA GACATCCTCA AATGTACATT GCTTGTGTTT GGAGTGTGGA 300 

TCCTTTATAT CCTCAAGTTA AATTATACTA CTGAAGAATG TGACATGAAA AAAATGCATT 360 

5 

ATGTGGACCC TGACCATGTA AAGAGAGCTC AGAAATATGC TCAGCAAGTC TTGCAGAAGG 420 

AATGTCGTCC CAAGTTTGCC AAGACATCAA TGGCGCTGTT ATTTGAGCAC AGGTATAGCG 480 

10 TGGACTTACT CCCITTTGTG CAGAAGGSCC CCAAAGACAG TGAAGCTGAG TCCAAGTACG 540 

ATCCTCCTTT TGGGTTCCGG AAGTTCTCCA GTAAAGTCCA GACCCTCTTG GAACTCTTGC 600 

CAGAGCACGA CCTCCCTGAA CACTTGAAAG CCAAGACCTG TCGGCGCTGT GTGGTTATTG 660 

15 

GAAGCGGAGG AATACTGCAC GGATTAGAAC TGGGCCACAC CCTGAACCAG TTCGATGTTG 720 

TGATAAGGTT AAACAGTGCA CCAGTTGAGG GATATTCAGA ACATGTTGGA AATAAAACTA 780 

20 CTATAAGGAT GACTTATCCA GAGGGCGCAC CACTGTCTGA CCTTGAATAT TATTCCAATG 840 

ACTTATTTGT TGCTGTTTTA TTTAAGAGTG TTGATTTCAA CTGGCTTCAA GCAATGGTAA 900 

AAAAGGAAAC CCTGCCATTC TGGGTACGAC TCTTCTTTTG GAAGCAGGTG GCAGAAAAAA 960 

25 

TCCCACTGCA GCCAAAACAT TTCAGGATTT TGAATCCAGT TATCATCAAA GAGACTGCCT 1020 

TTGRACATCC TTCAGTACTC AGAGCCTCAG TCAAGGTTCT GGGGGCCGAG ATAAGAACGT 1080 

30 CCCCACAATC GGTGTCATTG CCGTTGTCTT AGCCACACAT CTGTGCGATG AAGTCAGTTT 1140 

GGCGGGTTTT GGATATGACC TCAATCAACC CAGAACACCT TTGCACTACT TCGACAGTCA 1200 

ATGCATGGCT GCTATGAACT TTCAGACCAT GCATAATGTG ACAACGGAAA CCAAGTTCCT 1260 

35 

CTTAAAGCTG GTCAAAGAGG GAGTGGTGAA AGATCTCAGT GGAGGCATTG ATCGTGAATT 1320 

TTGAACACAG AAAACCTCAG TTGAAAATGC AACTCTAACT CTGAGAGCTG TTTTTGACAG 1380 

40 CCTTCTTGAT GTATTTCTCC ATCCTGCAGA TACTTTGAAG TGCAGCTCAT GTTTTTAACT 1440 

TTTAATTTAA AAACACAAAA AAAATTTTAG CTdTCCCAC TTTTTTTTTC CTATTTATTT 1500 

GAGGTCAGTG TTTGTTTTTG CACACCATTT TGTAAATGAA ACTTAAGAAT TGAATTGGAA 1560 

45 

AGACTTCTCA AAGAGAATTG TATGTAACGA TGTTGTWTTG ATTTTTAAGA AAGTAATTTA 1620 

ATTTGTAAAA CTTCTGCTCG TTTACACTGC ACATTGAATA CAGGTAACTA ATTGGAAGGA 1680 

50 GAGGGGAGGT CACTCTTTTG ATGGTGGCCC TGAACCTCAT TCTGGTTCCC TGCTGCGCTG 1740 

CTTGGTGTGA CCCACGGAGG ATCCACTCCC AGGATGACGT GCTCCGTAGC TCTGCTGCTG 1800 

ATACTGGGTC TGCGATGCAG CGGCGTGAGG CCTGGGCTGG TTGGAGAAGG TCACAACCCT 1860 

55 

TCTCTGTTGG TCTGCCTTCT GCTGAAAGAC TCGAGAACCA ACCAGGGAAG CTGTCCTGGA 1920 

GGTCCCTGGT CGGAGAGGGA CATAGAATCT GTGACCTCTG ACAACTGTGA AGCCACCCTG 1980 

60 GGCTACAGAA ACCACAGTCT TCCCAGCAAT TATTACAATT CTTGAATTCC TTGGGGATTT 2040 
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TTTACTGCCC TTTCAAAGCA CTTAAGTGTT AGATCTAACG TGTTCCAGTG TCTGTCTGAG 2100 

GTGACTTAAA AAATCAGAAC AAAACTTCTA TTATCCAGAG TCATGGGAGA GTACACCCTT 2160 

5 

TCCAGGAATA ATGTTTTGGG AAACACTGAA ATGAAATCTT CCCAGTATTA TAAATTGTGT 2220 

ATTTAAAAAA AAGAAACTTT TCTGAATGCC TACTGGCGGT GTATACCAGG CAGTGTGCCA 2280 

10 GTTTAAAAAG ATGAAAAAGA ATAAAAACTT TTGAGGAAMA AAAAAAAAAA AAAAACTCGA 2340 



15 (2) INFORMATION FOR SEQ ID NO; 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

25 AGTAGGGGAG ACTGAGACTG ACCGGTAGCC AGGCAGGCGG ACGACGCACG CCCGGACAGA 60 

CTGAGCAGGC GCCGGAGAAC CACTCACAGG TTCCCCCCGC CTTTCCCTTT GAAANCTAGG 120 

CTTTTGCCTT TCCCGTGGCG CCCGAGAGAG AATGCTGGAC TCTGCCGACT TCAGCGCAAC 180 

30 

TAANGATTTC TCAAGCTAGG GGACAAAOGA TCAGCCCAAT CCTGAGAAGG GGGGAACCAA 240 

GCACCCCGTC CCCATCCCCC TCCCCTCCCC CGACTAAACT CGGGCGCCAA ACCCAGCCCT 300 

35 TCTCTAACCA CCCTACTTCC TCCTCTCCTT TCTAGCATGG TGGCTGTATG GACAGTCTGA 360 

CAGAACAGAG ACTGACATCT CCCAATCTGC CGGCCCCCCA CCTGGAACAC TACAGTGTTC 420 

TGCATTGCAC CATGACCCTG GATGTGCAAA CTGTAGTCGT TTTTGCCGTG ATTGTAGTCC 480 

40 

TCCTGCTTGT CAATGTCATA CTCATGTTTT TCCTGGGAAC GCGCTGAATG GAGTCCAGNC 540 

ACCTGAGCTG TCGCGAACTC TCGCTTTGAT TTCATCCCGA GAGCCACCGA GAAGAAAAAA 600 

45 A 601 



50 (2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



60 



CTCGTGCCGA ATTCGGCACG AGAGATGGTA CTTTTAAGAG GTAATTAGGT TGCTAAGATG 



60 



WO 98/42738 

207 



PCT/US98/05311 



GATTAACATC TTTCTCTTGA CACTGAGACT GGGTTCTCCT GGGAATGGTT AGTTCCCAAG 120 

AGAGTGAGTT GTTATAAAAC AATGCTGCCT CTTCTATTTT GCGCTTTTTG TTTGCACAAA 180 

5 

CTCGGTCCCC TTCTGTTTCT CTACGATGTT TTGATGCRGC ATGAGGCAGT CATGAGAACC 240 

CACCAGATAC AGCTGCCTGA TCCTGAATTT CCCAGCCAAC AGAACCAAGT GCTAAATAAA 300 

10 ACTCTTTTTA ATAAGTTAAA AAAAAAAAAA AAAAAAAAAA AANAAANANA AAAAAAAAA 359 



15 (2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

25 GGCACGAGCT GCTGAGGCGT GAGAATGGCG TCCCGCGGCC GGCGTCCGGA GCATGGCGGA 60 

CCCCCAGAGC TGTTTTATGA CGAGACAGAA GCCCGGAAAT ACGTTCGCAA CTCACGGATG 120 

ATTGATATCC AGACCAGGAT GGCTGGGCGA GCATTGGAGC TTCTTTATCT GCCAGAGAAT 180 

30 

AAGCCCTGTT ACCTGCTGGA TATTGGCTGT GGCACTGGGC TGAGTGGAAG TTATCTGTCA 240 

GATGAAGGGC ACTATTGGGT GGGCCTGGAT ATCAGCCCTG CCATGCTGGA TGAGGCTGTG 300 

35 GACCGAGAGA TAGAGGGAGA CCTGCTGCTG GGGGATATGG GCCAQGGCAT CCCATTCAAG 360 

CCAGGCACAT TTGATGGTTG CATCAGCATT TCTGCTGTGC AGTGGCTCTG TAATGCTAAC 420 

AAGAAGTCTG AAAACCCTGC CAAGCGCCTG TACTGCTTTT TTGCTTCTCT TTTTTCTGTT 480 

40 

CTCGTCCGGG GATCCCGAGC TGTCCTGCAG CTGTACCCTG AGAACTCAGA GCAGTTGGAG 540 

CTGATCACAA CCCAGGCCAC AAAGGCAGGC TTCTCCGGTG GCATGGTGGT AGACTACCCT 600 

45 AACAGTGCCA AAGCAAAGAA ATTCTACCTC TGCTTGTTTT CTGGGCCTTC GACCTTTATA 660 

CCAGAGGGGC TGAGTGAAAA TCAGGATGAA GTTGAACCCA GGGAGTCTGT GTTCACCAAT 720 

GAGAGGTTCC CATTAAGGAT GTCGAGGCGG GGAATGGTGA GGAAGAGTCG GGCATGGGTG 780 

50 

CTGGAGAAGA AGGAGCGGCA CAGGCGCCAG GGCAGGGAAG TCAGACCTGA CACCCAGTAC 840 

ACCGGCCGCA AGCGCAAGCC CCGCTTCTAA GTCACCACGC GGTTCTGGAA AGGCACTTGC 900 

55 CTCTGCACTT TTCTATATTG TTCAGCTGAC AAAGTAGTAT TTTAGAAAAG TTCTAAAGTT 960 

ATAAAAATGT TTTCTGCAGT AAAAAAAAAG TTCTCTGGGC CGQGCGTGGT GGCTCACACC 1020 

TGTAATCCCA GCACCTTGGG AGGCTGAGGT GGGAGGATCA TTTGAGGCCA GGAGTTTGAG 1080 

60 
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ACCTGCCTGG GCAACATAAT GAAACTTCCT TTCCAGGGAG AAAAAAAAAA AAAAAAAAAA 1140 
A 1141 



(2) INFORMATION FOR SEQ ID NO: 55: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

TCCTTCTCTG GGGCG6TCGC GTTGGCAGCG GATGCGGGAA GCCGGACTCT GGGCGTCATG 60 

20 TACTACAAGT TTAGTGGCTT CACGCAGAAG TTGGCAGGAG CATGGGCITC GGAGGCCTAT 120 

AGCCCGCAGA TTNAAAGCCT GTGGTTTCCA CAGAAGCACC ACCTATCATA TTTGCCACAC 180 

CAACTAAACT GACCTCCGAT TCCACAGTGT ATGATTATGC TGGGAAAAAC AAAGTTCCAG 240 

25 

AGCTACAAAA GTTTTTCCAG AAAGCTGATG GTGTGCCCGT CTACCTGAAA CGAGGCCTGC 300 

CTGACCAAAT GCTTTACCGG ACCACCATGG CGCTGACTGT GGGAGGGACC ATCTACTGCC 360 

30 TGATCGCCCT CTACATGGCT TCGCAGCCCA AAAACAAATG AGTTAGGCTG CAGAGGACTG 420 

GTTTGTTTTT TGGCATAAAC CCTTTGAAGT TCCTTTTTCA TTGTTAAATT AAAATTTTTT 480 

TTTTTACTTG GATGGCTTAA CATTTTTGCA AGAAAAATAG GAAGATATGA AGATGATGTT 540 

35 

TTGGTTTGTT TATGAAATGC ATATGGCTTG TCAGAGCTCA TTCGACAGTT AAAGCCATTG 600 

TTTAAAGAAA CGGTGCTTTG CTCTGTGTTT GTGCTCCTGA TTTCCCTGGA GGTTCTGGAT 660 

40 GAAGGCTGAA CACAGGCTTG TTAATGTCAG TCTGTGCTGA GGACCTCAGG GACTTGAGGT 720 

TGCATTTTTG AGCATGGGGT GCAGGAGCCT TTCTGGATTT GGATGTGGCT ATGGAAAGAA 780 

CACAGAAGCC AAGGTCATGT GCATGAAATG AGGAGTTTGA GTTAGTCACC TCGGGGATTT 840 

45 

TTTCCATTTT GCAGTAAAAT GTTAAATTAA TGTAGCCTGC CTCTATTTGT TGGGCAGGTA 900 

ATTTCAAAGG GTTATTTGCC TCATCTCCTA TCTTTAGTGA AATCTTATGT GTAATTGTGT 960 

50 GTATTTATTC CACCGTGGGA ACAGAGAATA CCTGTTTAGT GTTGCACTTT AGACTGGTGT 1020 

CTGTTTTGTT AATGCAGCTG TGCCACAAAT TCTCCTTTAT CTTTTAAAAA TGTTATAGCT 1080 

TTAAATTTTG ATTTATTTTG ACTGTGGAAT AAATACATGA ATGAAAAATT TTAAGTTTGA 1140 

55 

AGTTCTTTGA ATGACCTTTC AGAGTAATTT CAGAACACCA GCAGCATCTT AAACCTGAGT 1200 

CTAATTTCTT TCTTGTTAAT TAGGCACCAG ATAATCTTTA TAAAATGGTC TTAAAAGCTA 1260 

60 GTAATAGGAG CTTAATGGCA ATKGATGATT ACCACAKGGT TTTTTATAAA AACCTGCCTG 1320 



WO 98/42738 



PCTYUS98/05311 



209 



10 



CCCCTWAGTG AAAGGTACCT GTAACYCACA GTYCATTTAG ACACTAATTT CCTYTGCYGT 1380 

CATGATTGGK AGACTTCACT TACCCTATAT TAAOTTTGAA AAAAGGTGGA ATTTTATTAT 1440 

ATATGAAGGA ATAGTTTGTA TCTTACCATA GCACAGAACA GTGACCTCTT GCTCAGGATA 1500 

AGATGTGGTG ATTTGAAAAT ACTCATAGTA GCCTTGCAGT GATACCTCTC TCNCTCTCTC 1560 

(2) INFORMATION FOR SEQ ID NO; 56: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GGAACGCAGA GCGGAGCGTG GAGAGCGGAG CGAAGCTGGA TAACAGGGGA CCGATGATGT 60 
25 GGCGACCATC AGTTCTGCTG CTTCTGTTGC TACTGAGGCA CGGGGCCCAG GGGAAGCCAT 120 
CCCCAGACGC AGGCCCTCAT GGCCAGGGGA GGGTGCACCA GGCGGCCCCC CTGAGCGACG 180 
CTCCCCATGA TGAOGCCCAC GGGAACTTCC AGTACGACCA TGAGGCTTTC CTGGGACGGG 240 

30 

AAGTGGCCAA GGAATTCGAC CAACTCACCC CAGAGGAAAG CCAGGCCCGT CTGGGGCGGA 300 
TCGTGGACCG CATGGACCGC GCGGGGGACG GCGACGGCTG GGTGTCGCTG GCCGAGCTTC 360 
35 GCGCGTGGAT CGCGCACACG CAGCAGCGGC ACATACGGGA CTCGGTGAGC GCGGCCTGGG 420 
ACACGTACGA CACQGACCGC GACGGGCGTG TGGGTTGGGA GGAGCTGCGC AACGCCACCT 480 
ATGGCCACTA CGCGCCCGGT GAAGAATTTC ATGACGTGGA GGATGCAGAG ACCTACAAAA 540 

40 

AGATGCTGGC TCGGGACGAG CGGCGTTTCC GGGTGGCCGA CCAGGATGGG GACTCGATGG 600 
CCACTCGAGA GGAGCTGACA GCCTTCCTGC ACCCCGAGGA GTTCCCTCAC ATGCGGGACA 660 
45 TCGTGATTGC TGAAACCCTG GAGGACCTGG ACAGAAACAA AGATGGCTAT GTCCAGGTGG 720 
AGGAGTACAT CGCGGATCTG TACTCAGCCG AGCCTGGGGA GGAGGAGCCG GCGTGGGTGC 780 
AGACGGAGAG GCAGCAGTTC CGGGACTTCC GGGATCTGAA CAAGGATGGG CACCTGGATG 840 

50 

GGAGTGAGGT GGGCCACTGG GTGCTGCCCC CTGCCCAGGA CCAGCCCCTG GTGGAAGCCA 900 
ACCACCTGCT GCACGARAGC GACACGGACA AGGAYGGGCG GCTGAGCAAA GCGSAAATCC 960 
55 TGGGTAATTG GAACATGTTT GTGGGCAGTC AGGCCACCAA CTATGGYGAG GACCTGACCC 1020 
GGCACCACGA TGAGCTGTGA GCMCCGNGCA CCTGCCACAG CCTCAGAGGC CCGCACAATG 1080 
ACCGGAGGAG GGGCCGCTGT GGTCTGGCCC CCTCCCTGTC CAGGCCCCGC AGGAGGCAGA 1140 

60 
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TGCAGTCCCA GGCATCCTCC TKCCCCTGGG CTCTCAGGGA CCCCCTGGGT CGGCTTCTGT 1200 

CCCTGTCACA CCCCCAACCC CAGGGAGGGG CTGTCATAGT CCCAGAGGAT AAGCAATACC 1260 

TATTTCTGAC TGAGTCTCCC AGCCCAGACC CAGGGACCCT NGGCCCCAAG CTCAGCTCTA 1320 

AGAACCGCCC CAACCCCTCC AGCTCCAAAT CTGAGCCTCC ACCACATAGA CTGAAACTCC 1380 

CCTGGCCCCA GCCCTCTCCT GCCTGGCCTG GCCTGGGACA CCTCCTCTCT GCCAGGAGGC 1440 

AATAAAAGCC AGCGCCGGGA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1500 

AAAAAAN 1507 

(2) INFORMATION FOR SEQ ID NO: 57: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
GAATTCGGCA CGAGCAGTGT CCAACACTGT AGCTGGTGCC TGCCAGGTTC CCAGTGGCTG 60 
30 GGGTCACCAG GTCTGAAGAG AGATGTGCTG GCTGCGGGCA TGGGSCCAGA TCYTCCTGCC 120 
AGTTTTCYTC TCCYTCTTTC TCATCCAATT GCTTATCAGC TTCTCAGAGA ATGGTTTTAT 180 
CCACAGCCCC AGGAACAATC AGAAACCAAG AGATGGGAAT RAAGAGGAAT GTGCTGTAAA 240 

35 

GAAGAGTTGT CAATTGTGCA CAGAAGATAA GAAATATATG ATGAATAGAT AATTGAAAAG 300 
AGATCCTCCA GAAAGAGCAG AAGGAAGTTT CTTCAATGGC TTCCTTCAGG ATTTTAATCA 360 
40 TCCTTACAGC CTCTTTGAGA ATGATTGAAC TTCCAAATTC CCTGAAGTTA AAATTTTAAA 420 
TTCTATTAAA CATTTTTTCG AGTAAAAAAA 450 



45 



(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GGCACGAGAC CCATTGAGCA GAAGGAGGCC AGGTGGGAAA GCTCCTGGGA AGAGCAGCCA 60 
GACTGGACAC TGQGCTGCTT GAGTCCTGAG TCACAATTCA GAATTCCTGG GCTCCCTGGG 120 

60 
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TGCATTCTAT CATTCCAGTT GAAAGTTTGC TTCCTTCCAG TCATGTGGCT CTTCATTCTA 180 

CTCTCCTTGG CTCTCATTTC AGATGCCATG GTCATGGATG AAAAGGTCAA GAGAAGCTTT 240 

GTGCTGGACA CGGCTTCTGC CATCTGCAAC TACAATGCCC ACTACAAGAA TCACCCCAAA 300 

TACTGGTGCC GAGGCTATTT CCGTGACTAC TGCAACATCA TCGCCTTCTC CCCTAACAGC 360 

ACCAATCATG TGGCCCTGAA GGACACAGGG AACCAGCTCA 1TGTCACTAT GTCCTGCCTG 420 

AACAAAGAAG ACACGGGCTG GTACTGGTGT GGCATCCAGC GGGACTTTGC CAGGGATGAC 480 

ATGGATTTTA CAGAGCTGAT TGTAACTGAC GACAAAGGAA CCTGGCCAAT GACTTTGGTC 540 

15 TGGGAAAGAC TATCAGGCAC AAAACCAGAA GCTGCAAGGC TCCCAAAGTT GTCCGCAAGG 600 

CTGACCGCTC CAGGACGTCC ATTCTCATCA TTTGCATACT GATCACGGGT TTGGGAATCA 660 

TCTCTGTAAT CAGTCATTTG ACCAAAAGGA GGAGAAGTCA AAGGAATAGA AGGGTAGGCA 720 

20 

ACACTTTGAA GCCCTTCTCG CGTGTCCTGA CTCCAAAGGA AATGGCTCCT ACTGAACAGA 780 

TGTGACTGAA GATTTTTTTA ATTTAGTTCA TAAAGTGATG CTACAACAGA ATAATCACCA 840 

25 TGACAACTGG CCCCACACCT CAGAGACTGA TTCTGATCTC CCAGGAATTC TGAAGGTCCC 900 

TCTATCCTTG ACAACAATCA TTTGCAGCCA GGTAGCAACG GCAGTAGTCA GAGGAGCTAT 960 

GATAGACCAC ACCCAAGCAA GGCTGCCCTC AAATAACATC TCAAGATCTT AGTTCTTATG 1020 

30 

CATTCCATCA GTCAGAAGTG AAGAAGAGGT GGAGAATCTG GATTGGGGAC CAGGAAATCA 1080 

CTTGTAOTTT GTTAGCCAAT AAATTCCTAG CCAGTGTTGA ATGAAAAAAA AAAAAAAAAA 1140 

35 AAAAAAA 1147 



40 (2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

50 GGCAGAGGCT CCTCAGAAGG GCGTGGGCTC TCCAGTCTTC CACAGTCCCC ACCATGCCCT 60 

GTTGCCTTAC CGCTGACGTA GCTCACCCAT CTTTTACTTG CCTGGCTAAG ATGCATGGCA 120 

TYWCATTTCC TCCTTGTTGC ACTGCAGTCA GTCCCTCACT GCCCCCATCT CCTGGAAGAG 180 

55 

GAGCATAAGC TTTGCAAGGT CAGCCACTTC TCTGGGGTCA CACTAGTTAC ATCAAGACAG 240 

GACTCCAGCT CATATGTGCC AGTGCAGACA CTCTTCATCC ACCTGGGGCC CTGGGCTTGG 300 

60 GACCTGGYTC CTTGCACAGC AGARGACCCG GAGGCTGAGA GGAGCTTGCG GTTGTGTCAT 360 
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AGTCACCTGG CCAGARGGAA CGTGAGCCCC TCCCAAGCTG CAGARGGARG GARCARGCGT 420 

GGCTGTCAGC ACCGAGGTAG CAGAGAATTA ACATTCTTGT CAGCAGAGAA TGAAGCAGGA 480 

5 

ATATAATTAA AACTTTGCCC TTGGAATAGC TGATTCATTT GAATTTTATT CCACACGTTT 540 

GAAAGAGGAA AGAAAATGTG AAGACTTGCA GCCTGGTTCT CGCCTGGCCT GGGCTGGCCC 600 

10 AGCTGTCAGG CCCGGTTCCT TTCTGAGCAT TCAGTCCACT GATGTTGACT GAGGGCCAGG 660 

AGAGACCCTC AGCAGGGTAT TACCATATCA GCCTCCTATC GCTGCTGGGA GAAATTACCA 720 

TGAATTCAGT GGCTTAAAAC AACACACGAG CCTCTCTGAG CCTACCCTGG CTCAGGA 777 



(2) INFORMATION FOR SEQ ID NO: 60: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



AAGANTGATT TTCCTTACTC TCCAAAGCGT CAGCATTTTG AAGTTTCTTT TATGAAAGTG 60 

30 

GGGGCAAGAA TCAGGGTGAA AATGAGTGTA AACAAAGCCC ATCCTGTGGT CAGCACCCAC 120 



TGGAGGTGGC CAGCAGAGTG GCCTCAGATG TTCCTGCACC TGGCCCAGGA GCCCAGGACA 180 
35 GAGGTCAAAT CTAGGCCCCT TGGTCTGGCT GGATTCATCA GGCAAGATTC GAAAACAAGA 240 



AAACCTCTAG AACAAGAAAC AATCATGTCT GCAGCAGATA CGGCACTGTG GCCCTATGGC 300 



CATGGCAATC GTGAGCACCA AGAGAATGAG TTACAGAAAT ATCTCCAATA CAAAGACATG 360 

40 

CATCTCCTGG ACAGTGGACA GTCGCTGGGA CACACACACA CACTTCAAGG CTCACACAAC 420 



CTAACAGCCT TAAATATCTG AAGAAACAGA ATCACGACAT TAAGTCAGCA GAGGGAGAGG 480 
45 TAGGCTGAAG CAGCAGGAGG CCAATTTTAT ATCCCACAGA TTTTTTTAAA AATGACTCCC 540 



CAGCAAGGGG TGGGGAGAAA GCCACTGATT TAGGAGAGTT CTTGGCTCAG CCAACCACTG 600 



CGGTTATCTA CACGTTTTAC AAAGGCACRG AAGTAGAGAG GGGCTGCACT CACGACCdC 660 

50 

OCCAGGGCCC GCACAGCCAG ACACGGTGGG TTCTTCCTTT TTCCCTTCTG GCCTTGGTGG 720 



AATTCCTACC ACGGTGGCCT CTGCCTTTGG GACAATGCCT TCATGCTCAT CCCCGGGTCA 780 
55 AGGATGGAGT CTGTTACCAT TTTCCAGGGG AAATTCCAAG GACCAGCCCC GCCTCATTAC 840 



GTTCACCCCA CAGGAAGGTG ATCTGGAAAG CCTGTAAACA CGTACTCTGG GTGGCTGAGT 900 



GGTGTCACCA AGCTGCTTTT GTGCAGGGCT GAAGCACAGA CAAGAGGGCA GGCAGCTGCC 960 

60 



WO 98/42738 



PCT/US98/05311 



213 

GGAGGCCTGA AGTGGGGAGA GATCCCCGCA GGCCTGCAGG AGCCAGGGAG AACCTCCAAC 1020 
TGGATCTAAA CTGTGGGACA GCCCAGGCGT GCCCXTTCTTC ACATGGCTCC CAGGCTCCCT 1080 
5 CAAAGCCCTT CCCAGGCCCT GCAGGAAGAG AGGGAGGGTG AGGAGAGGCA GGGAGGGCAG 1140 
AGGTCGCCTG AAAGCCTGGG CTCCGAACTC CCTCAGCAGA GCTTTAAAGT G 1191 

10 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

CCCCGCCCCC CGCCCACGAA GGAAGTGGCT GCTGCTCCGG CGCGGACCCA GAGCCGGTTC 60 

GGCGCGTCGA CTGCCCAGAG TCCGCGGCCG GGCGCGGGAG GAGCCAAGCC GCCATGGCCT 120 

25 

ACCACAGCTT CCTGGTGGAG CCCATCAGCT GCCACGCCTG GAACAAGGAC CGCACCCAGA 180 

TTGCCATCTG CCCCAACAAC CATGAGGTGC ATATCTATGA AAAGAGCGGT GCCAAATGGA 240 

30 CCAAGGTGCA CGAGCTCAAG GAGCACAACG GGCAGGTGAC AGGCATCGAC TGGGCCCCCG 300 

AGAGTAACCG TATTGTGACC TGCGGCACAG ACCGCAACGC CTACGTGTGG ACGCTGAAGG 360 

GCCGCACATG GAAGCCCACG CTGGTCATCC TGCGGATCAA CCGGGCTGCC CGCTGCGTGC 420 

35 

GCTGGGCCCC CAACGAGAAC AAGTTTGCTG TGGGCAGCGG CTCTCGTGTG ATCTCCATCT 480 

GTTATTTCGA GCAGGAGAAT GACTGGTGGG TTTGCAAGCA CATCAAGAAG CCCATCCGCT 540 

40 CCACCGTCCT CAGCCTGGAC TGGCACCCCA ACAATGTGCT GCTGGCTGCC GGCTCCTGTG 600 

ACTTCAAGTG TCGGATCTTT TCAGCCTACA TCAAGGAGGT GGAGGAACGG CCGGCACCCA 660 

CCCCGTGGGG CTCCAAGATG CCCTTTGGGG AACTGATGTT CGAATCCAGC AGTAGCTGCG 720 

45 

GCTGGGTACA TGGCGTCTGT TTCTCAGCCA GCGGGAGCCG CGTGGCCTGG GTAAGCCACG 780 

ACAGCACCGT CTGCCTGGCT GATGCCGACA AGAAGATGGC CGTCGCGACT CTGGCCTCTG 840 

50 AAACACTACC ACTGCTGGCG CTGACCTTCA TCACAGACAA CAGCCTGGTG GCAGCGGGCC 900 

ACGACTGCTT CCCGGTGCTG TTCACCTATG ACGCCGCCGC GGGGATGCTG AGCTTCGGCG 960 

GGCGGCTGGA CGTTCCTAAG CAGAGCTCGC AGCGTGGCTT GACGGCCCGC GAGCGCTTCC 1020 

55 

AGAACCTGGA CAAGAAGGCG AGCTCCGAGG GTGGCACGGC TGCGGGCGCG GGCCTAGACT 1080 

CGCTGCACAA GAACAGCGTC AGCCAGATCT CGGTGCTCAG CGGCGGCAAG GCCAAGTGCT 1140 

60 CGCAGTTCTG CACCACTGGC ATGGATGGCG GCATGAGTAT CTGGGATGTG AAGAGCTTGG 1200 
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AGTCAGCCTT GAAGGACCTC AAGATCAAAT GACCTGTGAG GAATATGTTG CCTTCATCCT 1260 

AGCTGCTGGG GAAGCGGGGA GAGGGGTCAG GGAGGCTAAT QGTTGCTTTG CTGAATGTTT 1320 

5 

CTGGGGTACC AATACGAGTT CCCATAQGGG CTGCTCCCTC AAAAAGGGAG GGGACAGATG 1380 

GGGAGCTTTT CTTACCTATT CAAGGAATAC GTGCCTTTTT CTTAAATGCT TTCATTTATT 1440 

10 GAAAAAAAAA AAAAATGCCC CCAAAGCACT ATGCTGGTCA TGAACTGCTT CAAAATGTGG 1500 

AGGTAATAAA ATGCAACTGT. GTAAAAAAAA AAAAAAAAAA AAATGACCCT CGCGATCTAG 1560 

AACTAGNCGG ACGCNTGGGT 1580 

15 



20 



(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

GGCACGAGGC GCGATGCAGC ACAGGCTAGA GGCTGCGCAA SGCGGGGGCC CGCCCCTGGG 60 

30 

ACCCTCCGGG CCGGGCGGTT TGGCCCCTTA GCGCCCGGGC GTCGGGGCGG TAAAAGGCCG 120 

GCAGAAGGGA GGCACTTGAG AAATGTCTTT CCTCCAGGAC CCAAGTTTCT TCACCATGGG 180 

35 GATGTGGTCC ATTGGTGCAG GAGCCCTGGG GGCTGCTGCC TTGGCATTGC TGCTTGCCAA 240 

CACAGACGTG TTTCTGTCCA AGCCCCAGAA AGCGGCCCTG GAGTACCTGG AGGATATAGA 300 

CCTGAAAACA CTGGAGAAGG AACCAAGGAC TTTCAAAGCA AAGGAGCTAT GGGAAAAAAA 360 

40 

TGGAGCTGTG ATTATGGCCG TGCGGAGGCC AGGCTGTTTC CTCTGTCGAG AGGAAGCTGC 420 

GGATCTGTCC TCCCTGAAAA GCATGTTGGA CCAGCTGGGC GTCCCCCTCT ATGCAGTGGT 480 

45 AAAGGAGCAC ATCAGGACTG AAGTGAAGGA TTTCCAGCCT TATTTCAAAG GAGAAATCTT 540 

CCTGGATGAA AAGAAAAAGT TCTATGGTCC ACAAAGGCGG AAGATGATGT TTATGGGATT 600 

TATCCGTCTG GGAGTGTGGT ACAACTTCTT CCGAGCCTGG AACGGAGGCT TCTCTGGAAA 660 

50 

CCTGGAAGGA GAAGGCTTCA TCCITGGGGG AGTTTTCGTG GTGGGATCAG GAAAGCAGGG 720 

CATTCTTCTT GAGCACCGAG AAAAAGAATT TGGAGACAAA GTAAACCTAC TTTCTGTTCT 780 

55 GGAAGCTGCT AAGATGATCA AACCACAGAC TTTGGCCTCA GAGAAAAAAT GA1TGTGTGA 840 

AACTGCCCAG CTCAGGGATA ACCAGGGACA TTCACCTGTG TTCATGGGAT GTATTGTTTC 900 

CACTCGTGTC CCTAAGGAGT GAGAAACCCA TTTATACTCT ACTCTCAGTA TGGATTATTA 960 

60 
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ATGTATTTTA ATATTCTGTT TAGGCCCACT AAGGCAAAAT AGCCCCAAAA CAAGACTGAC 



1020 



AAAAATCTGA AAAACTAATG AGGATTATTA AGCTAAAACC TGGGAAATAG GAQGCTTWAA 



1080 



ATGACTGCCM GCTGGTGCRT GCTCACACTT GGCCCAC 



1117 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 63: 

CCCACGCGTG CKGGCGCCTG GCAGCCACCG CCTGGGAGGT TACTGTAAGG CCCGCAGCTC 60 

CCGCCAGCTC CCGCGGACTS CTGCCGCCTC CTTACCATGA AGCCAGTAAG TCGTCGCACG 120 

CTGGACTGGA TTTATTCAGT GTTGCTGCTT GCCATCGTTT TAATCTCCTG GGGCTGCATC 180 

ATCTATGCTT CGATGGTGTC TGCAAGACGA CAGCTAAGGA AGAAATACCC AGACAAAATC 240 

TTTGGGACGA ATGAAAATTT GTAACTCTTC TGGATTTAAT TATCTGAAAA TACAGTTCTT 300 

TCCCTCATGC TTATGTAGAT ATAAAAATAA AATTCATAAT GCAAAAAAAA AAAAAAAAAA 360 

G 361 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

GGCACGAGGT CTGCCAAGCT ATAGACCATG GCTGTGAACA CATTTGTGTG AACAGTGACG 60 

ACTCATACAC GTGCGAGTGC TTGGAGGGAT TCCGGCTCGC TGAGGATGGG AAACGCTGCC 120 

GAAGAAGGAT GTCTGCAAAT CAACCCACCA TGGCTGCGAA CACATTTGTG TTAATAATGG 180 

GAATTCCTAC ATCTGCAAAT GCTCAKAGGG ATTTGTTCTA GCTGAGGACG GAAGACGGTG 240 

CAAGAAATGC ACTGAAGGCC CAATTGACCT GGTCTTTGTG ATCGATGGAT CCAAGAGTCT 300 

TGGAGAAGAG AATTTTGAGG TCGTGAAGCA GTTTGTCACT GGAATTATAG ATTCCTTGAC 360 

AATTTCCCCC AAAGCCGCTC GAGTGGGGCT GCTCCAGTAT TCCACACAGG TCCACACAGA 420 
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GTTCACTCTG AGAAACTTCA ACTCAGCCAA AGACATGAAA AAAGCCGTGG CCCACATGAA 480 

ATACATGGGA AAGGGCTCTA TGACTGQGCT GGCCCTGAAA CACATGTTTG AGAGAAGTTT 540 

TACCCAAGGA GAAGQGGCCA GGCCCTTTCC ACAAGGGTGC CCAGAGCAGC CATTGTGTTC 600 

ACCGACGGAC GGGCTCAGGA TGACGTCTCC GAGTGGGCCA GTAAAGCCAA GGCCAATGGT 660 

ATCACTATGT ATGCTGTTGG GGTAGGAAAA GCCATTGAGG AGGAACTACA AGAGATTGCC 720 

TCTGAGCCCA CAAACAAGCA TCTCTTCTAT GCCGAAGACT TCAGCACAAT GGATGAGATA 780 

AGTGAAAAAC TCAAGAAAGG CATCTGTGAA GCTCTAGAAG ACTCCGATGG AAGACAGGAC 840 

15 TCTCCAGCAG GGGAACTGCC AAAAACGGTC CAACAGCCAA CAGTGCAACA CAGATATCTG 900 

TTTGAAGAAG ACAATCTTTT ACGGTCTACA CAAAAGCTTT CCCATTCAAC AAAACCTTCA 960 

GGAAGCCCTT TGGAAGAAAA ACACGATCAA TGCAAATGTG AAAACCTTAT AATGTTCCAG 1020 

20 

AACCTTGCAA ACGAAGAAGT AAGAAAATTA ACACAGCGCT TAGAAGAAAT GACACAGAGA 1080 

ATGGAAGCCC TGGAAAATCG CCTGAGATAC AGATGAAGAT TAGAAATCGC GACACATTTG 1140 

25 TAGTCATTGT ATCACGGATT ACAATGAACG CAGTGCAGAG CCCCAAAGCT CAGGCTATTG 1200 

TTAAATCAAT AATGTTGTGA AGTAAAACAA TCAGTACTGA GAAACCTGGT TTGCCACAGA 1260 

ACAAAGACAA GAAGTATACA CTAACTTGTA TAAATTTATC TAGGAAAAAA ATCCTTCAGA 1320 

30 

ATTCTAAGAT GAATTTACCA GGTGAGAATG AATAAGCTAT GCAAGGTATT TTGTAATATA 1380 

CTGTGGACAC AACTTGCTTC TGCCTCATCC TGCCTTAGTG TGCAATCTCA TTTGACTATA 1440 

35 CGATAAAGTT TGCACAGTCT TACTTCTGTA GAACACTGGC CATAGGAAAT GCTGTTTTTT 1500 

TGTAYTGGAC TTTACCTTGA TATATGTATA TGGATGTATG CATAAAATCA TAGGACATAT 1560 

GTACTTGTGG AACAAGTTGG ATTTTTTATA CAATATTAAA ATTCACCACT TCAGAGRAAA 1620 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAANAAAA 1668 



40 



45 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: X353 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

55 

GGGTCGACCC ACGCGTCCGC CCACGCGTCC GGATGGCTGC GCTGTTGCTG AGACACGTTG 60 
GTCGTCATTG CCTCCGAGCC CACTTTAGCC CTCAGCTCTG TATCAGAAAT GCTGTTCCTT 120 
60 TGGGAACCAC GGCCAAAGAA GAGATGGAGC GGTTCTGGAA TAAGAATATA GGTTCAAACC 180 
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GTCCTCTGTC TCCCCACATT ACTATCTACA GTTGGTCTCT TCCCATGGCG ATGTCCATCT 240 

GCCACCGTGG CACTGGTATT GCTTTGAGTG CAGGGGTCTC TCTTTTTGGC ATGTCGGCCC 300 

5 

TGTTACTCCC TGGGAACTTT GAGTCTTATT TGGAACTTGT GAAGTCCCTG TGTCTGGGGC 360 

CAGCACTGAT CCACACAGCT AAGTTTGCAC TTGTCTTCCC TCTCATGTAT CATACCTGGA 420 

10 ATGGGATCCG ACACTTGATG TGGGACCTAG GAAAAGGCCT GAAGATTCCC CAGCTATACC 480 

AGTCTGGAGT GGTTGTCCTG GTTCTTACTG TGTTGTCCTC TATGGGGCTG GCAGCCATGT 540 

GAAGAAAGGA GGCTCCCAGC ATCATCTTCC TACACATTAT TACATTCACC CATCTTTCTG 600 

15 

TTTGTCATTC TTATCTCCAG CCTGGGAAAA GTTCTCCTTA TTTGTTTAGA TCCTTTTGTA 660 

TTTTCAGATC TCCTTGGAGC AGTAGAGTAC CTGGTAGACC ATAATAGTGG AAAAGGGTCT 720 

20 AGTTTTCCCC TTGTTTCTAA AGATGAGGTG GCTGCAAAAA CTCCCCTTTT TTGCCCACAG 780 

CTTGCCTACT CTCGGCCTAG AAGCAGTTAT TCTCTCTCCA TATTGGGCTT TGATTTGTGC 840 

TGAGGGTCAG CTTTTGGCTC CTTCTTCCTG AGACAGTGGA AACAATGCCA GCTCTGTGGC 900 

25 

TTCTGCCCTG GGGATGGGCC GGGTTGGGGG GTGGGTTGGT GAGGCTTTGG GTGCCACTGC 960 

CTGTGGGTTG CTGGCTTAAA GGACAATTCT CTTCATTGGT GAGAGCCCAG GCCATTAACA 1020 

30 CCTACACAGT GTTATTGAAA GAAGAGAGGT GGGGGTGGAG GGGAATTAGT CTGTCCCAGC 1080 

TAGAGGGAGA TAAAGAGGGC TAGTTAGTTC TTGGAGCAGC TGCTTTTGAG GAGAAAATAT 1140 

ATAGCTTTGG ACACGAGGAA GATCTAGAAA ATTATCATTG AACATATTAA TGGTTATTTC 1200 

35 

TTTTTC1TGG ATTTCCAGAA AAGCCTCTTA ATTTTATGCT TTCTCATCGA AGTAATGTAC 1260 

CCTTTTTTTC TGAAACTGAA TTAAATACTC ATTTTATCTT TGAAAAAAAA AAAAAAAACC 1320 

40 TNGGGGGGGG CCCCGGACCC NAATTGGCCC TAT 1353 



45 (2) INFORMATION FOR SEQ ID NO: 66: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
55 CGGAAGAAAG CAGCCATCCA GACATTTCAG AACACGTACC AGGTGTTAGC TGTGACCTTC 60 
AATGACACAA GTGATCAGAT TATTTCTGGT GGAATAGACA ATGATATCAA GGTCTGGGAC 120 



60 



TGCGCCAGAA CAAGCTAACC TACACCATGA GAGGCCATGC AGATTCAGTG ACTGGCCTGA 



180 
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GTTTAAGTTC TGAAGGCTCT TATCTTTTGT CCAATGCAAT GGACAATACA GTTCGTGTCT 240 

GGGATGTCCG GCCATTTGCC CCCAAAGAGA GATGTGTAAA GATATTTCAA GGAAATGTGC 300 

ACAACTTTGA AAAGAACCTT CTGAGATGTT CTTGGTCACC TGATGGAAGC AAAATAGCAG 360 

CTGGCTCAGC CGACAGGTTT GTTTATGTGT GGGATACCAC AAGCAGGAGA ATATTGTATA 420 

AGCTGCCCGG CCATGCTGGC TCCATCAATG AAGTGGCTTT CCACCCTGAT GAGCCCATCA 480 

TTATCTCAGC ATCGAGTGAC AAGAGACTGT ATATGGGAGA GATTCAGTGA AGATATGGAC 540 

TGGAAGACTC CAAQGCCGCT TGTCTTTGAG ACCTCAGACT GCATAAGTGA TGCCAAATGT 600 

15 TGGATGTCCA GGYTAGCACC CTCCCTTCAG ATGACCATTG CTAGCAAGAA ACAGGAGGCG 660 

GTGGCCATAT TCCAAAAACC ACTTCTGTCC CATTTCACCA GGATGACTAA GGCAAGCTCC 720 

CTGTGGCCTC TAAAAACCAC CTGCCAGATT TCAGGGACTG TTTTTTTTTT TCTTTTTCTT 780 

20 

TTTTCCTGTT TTCTAATGCA GGCCCAATGT GACAAATTTG TTGGTTGGGA TTTTTTTTTT 840 

TTTTTGTAAC TGGCTTGTAT GATATTTTCT TTCTGTATTT CTCTATATCA TTTTGTATTA 900 

25 AAAGCCAAAT AGATGCCTTT TTACAAGARM AAAAAAAAAA AAAAAAAAAA NNAAAAAAAA 960 

CTGGGAGGGG GGGCCCGGTA CCCAAATCGC CGGATATGAT CGTAAACAAT C 1011 



30 



(2) INFORMATION FOR SEQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) IiENGrTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

GGCCGGGCGG TGCGCACTGC GGGCGCATCC CTGCCCCGGC GCCCTCCGTG CCCGCGGGAC 60 

CTGACAGCCG GGTCAGAGGG CGAACTGTGC TCAGGCCCGG GCTGGACGCA GAGCCAGAGC 120 

45 

TGTCCCCAGA GGAGCAGAGG GTCCTGGAAA GGAAGCTGAA AAAGGAACGG AAGAAAGAGG 180 

AGAGGCAGCG TCTGCGGGAG GCAGGCCTTG TGGCCCAGCA CCCGCCTGCC AGGCGCTCGG 240 

50 GGGCCGAACT GGCCTGGGAC TACCTCTGCA GATGGGCCCA AAAGCACAAG AACTGGAGGT 300 

TTCAGAAGAC GAGGCAGACG TGGCTCCTGC TGCACATGTA TGACAGTGAC AAGGTTCCCG 360 

ATGAGCACTT CTCCACCCTG CTGGCCTACC TGGAGGGGCT GCAGGGCCGG GCCCGAGAGC 420 

55 

TGACGGTGCA GAAGGCGGAA GCCTGATGCG GGAGCTGGAT GAGGAGGGCT CTGATCCCCC 480 

CCTGCCGQGG AGGGCCCAGC GCATCCGACA GNTGCTGCAG CTGCTCTCCT AGTGGGTTCA 540 

60 GCGCGGGGCG GGGCCGCTGC CCAGTGCAGG GCTGCCTCAG ACCACACAGG GTGCAGCTCC 600 
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TCCGGCGGTG GGGGCCGGGT TCACCAGCAG GGCAGCGGCT GAGCAAGGGC TTTCAGCTCC 660 

TCCGGTGGTG GGGGCCGGGA TCACCAGCAG CAGAGCCTCG CAAGGGCCCC TTCCCTCCTC 720 

5 

CAGACCCTCC TTGGCCGGTG ACGCTGTGAC AGTGATGGCA QGTTCAGTGC CTTCAGCGCA 780 

GAGCGTGGAT GCTCTGGAAT CACCCGGACC CCTGGCCTTG GAGGGACCCT CCAGCCCCAG 840 

10 GAATCTGCTT TGGAGGGAAA TGTCTATTTT TCTACCGGGA ATATTTTAGA GATTGGGGCA 900 

TGCTGGCTCC TCCCGCCAGC TGCAAACCTG CACCTTCCGC CTGATTCCCG ATCCCCCTGC 960 

GTGGGCCGCA TTCCTGGTCC CCTGCCTGCG TCCATCGAGG GGCCTGGCTG TGGCCTGTTT 1020 

TCCTTTGACC CCACACAGCG TCATTGCGGG TCATGGGGAG CCCCTGGTGG GAGCTTGTGG 1080 

AGTCGGATCA CGTACCTGTG CAGAAACGGC CTCTGTGGCT GCATTTGAAA TAAAACCCGA 1140 

20 CCCAGCAGCA AAAAAAAAAA AAAAAANCNC NAGGGGGGGC CCGGNACCCA ATT 1193 



25 (2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

35 GAATTCGGCA CGAGTTGGCA CATGATGCAA AATGCATTTC TCAGAGTAGA TTGCAGTCAA 60 

AAATGTTGGA AACTACTAAG CATGTGCARA TAGCATGCAT GCTGCTGCTG ACCTGCCAGA 120 

TATTTCTCCC TTCCTCCCTT TCTCCCTCAT TTATTCATTC ATTAACTGAT TCATTCATCC 180 

40 

CATTAAAAAA ATTATATGTA TGTTTTGTGC AAAGCACCCT ACTCAAGGCT GCGGGGTACA 240 

AAAGTATATC AGAAGCCTTG GGCTTTGACM WACTTCTCTG TAGTAGTGCT AGATTTGTGT 300 

45 GGATCTGCCA CACTTACTCC AGGCCTCTTG TGACCTGTGC TTTGCATTAA TCTCTTAGGC 360 

TAAGCCACAT ACCTTTTCAT TATACAATCT TTGCTGATGC TAAGGACAGA TTCCAAAGTG 420 

CCCTCCTTAT AATTTTTGTA TTTAATGCAA AGTGTAATCA AGAATAGGCC ATTCTTAGGT 480 

50 

CAATTGCTTT TCTGTATTTA TCTTTTCAAA CAATAAATAA TCAGTGGGAT GAAAAAGGGC 540 

CGGAAAAAAA AAAAAAAAAA 560 

55 



60 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



CGGACNGAGC CGCCGCCGGG CACTTCCTGT GGAQGCCGCA GCGGGTGCGG GCGCCGACGG 60 
10 GCGAGAGCCA GCGAGCGAGC GAGCGAGCCG AGCCGAGCCT CCCGCCGTCG CCATGGGCCA 120 



GAACGACCTG ATGGGCACGG CCGAGGACTT CGCCGACCAG TTCCTCCGTG TCACAAAGCA 180 



GTACCTGCCC CACGTGGCGC GCCTCTGTCT GATCAGCACC TTCCTGGAGG ACGGCATCCG 240 

15 

TATGTGGTTC CAGTGGAGCG AGCAGCGCGA CTACATCGAC ACCACCTGGA ACTGCGGCTA 300 



CCTGCTGGCC TCGTCCTTCG TCTTCCTCAA CTTGCTGGGA CANTGACTGG CTGCGTCCTG 360 
20 GTGTTGAGCA GGAACTTCGT GCAGTACGCC TGCTTCGGGC TCTTTGGAAT CATAGCTCTG 420 



CAGACGATTG CCTACAGCAT TTTATGGGAC TTGAAGTTTT TGATGAGGAA CCTGGCCCTG 480 



GGAGGAGGCC TGTTGCTGCT CCTAGCAGAA TCCCGTTCTG AAGGGAAGAG CATGTTTGCG 540 

25 

GGCGTCCCCA CCATGCGTGA GAGCTCCCCC AAACAGTACA TGCAGCTCGG AGGCAGGGTC 600 



TTGCTGGTTC TGATGTTCAT GACCCTCCTT CACTTTGACG CCAGCTTCTT TTCTATTGTC 660 
30 CAGAACATCG TGGGGCACAG CTCTGATGAT TTTAGTGGCC ATTGGTITTA AAACCAAGCT 720 



GGCTGCTTTG ACTCTTGTTG TGTGGCTCTT TGCCATCAAC GTATATTTCA ACGCCTTCTG 780 



GACCATTCCA GTCTACAAGC CCATGCATGA CTTCCTGAAA TACGACTTCT TCCAGACCAT 840 

35 

GTCGGTGATT GGGGGCTTGC TCCTGGTGGT GGCCCTGGGC CCTGGGGGTG TCTCCATGGA 900 



TGAGAAGAAG AAGGAGTGGT AACAGTCACA GATCCCTACC TGCCTGGCTA AGACCCGTGG 960 
40 CCGTCAAGGA CTGGTTCGGG GTGGATTCAA CAAAACTGCC AGCTTTTATG TATCCTCTTC 1020 



CCTTCCCCTC CCTTGGTAAA GGCACAGATG TTTTGAGAAC TTTATTTGCA GAGACACCTG 1080 



AGAATCAATG GCTTCAGGAC ATGGGTTCTC TTCTCCTGTG ATCATTCAAG TGCTCACTGC 1140 

45 

ATGAAGACTG GCTTGTCTCA GTGTTTCAAC CTCACCAGGG CTGTCTCTTG GTCCACACCT 1200 



CGCTCCCTGT TAGTGCCGTA TGACAGCCCC CATCAAATGA CCTTGGCCAA GTCACGGTTT 1260 
50 CTCTGTGGTC AAGGTTGGTT GGCTGATTGG TGGAAAGTAG GGTGGACCAA AGGAGGCCAC 1320 



GTGAGCAGTC AGCACCAGTT CTGCACCAGC AGCGCCTCCG TCCTAGTGGG TGTTCCTGTT 1380 



TCTCCTGGCC CTGGGTGGGC TAGGGCCTGA TTCGGGAAGA TGCCTTTGCA GGGAGGGGAG 1440 

55 

GATAAGTGGG ATCTACCAAT TGATTCTGGC AAAACAATTT CTAAGATTTT 1TTGCTTTAT 1500 



GTGGGAAACA GATCTAAATC TCATTTTATG CTGTATTTTA TATCTTAGTT GTGTTTGAAA 1560 
60 ACGTTTTGAT TTTTGGAAAC ACATCAAAAT AAATAATGGC GTTTGTTGTA AAAAAAAAAA 1620 
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AAAAAAACTC GRGGGGGGGC CCGGTACCCA AATCGCC 1657 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 
10 {A) LENGTH: 711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

GGCACGAGCG AAGACCCTGT TCGGACCCTG CCCCGA1TCC AGACTCAGGT AGATCGTCGG 60 

CATACCCTCT ACCGTGGACA CCAGGCAGCC CTGGGGCTGA TGGAGAGAGA TCAGGTATCC 120 

20 

CCCAGGGAGT AGGGGCTACC TTGAGGGGAT GATAGACCTC CCCCACTCCC AGTGKKACTC 180 

TGGAAATATG AAGGAACTAG GGAGTGGAAG AGATTTCAGA GCTGGGGAGA GGAGTTCCTC 240 

25 CCTTCAAAGC CAGCAACTGC CTTTGGGGAA TGTCGGGGGG TCTCTCCTTT CTCCTGCTTG 300 

TTTRAGGTGG TACACAGTCC CCCCTTCAMC TGGSGGGAAG CTGTNCCGGA CARACTCATC 360 

TCAGCTTTCC CTTGGGGCAG GATCGGGGGC AGCAGCTCCA GCAGAAACAG CAGGATCTGG 420 

30 

AGCAGGAAGG CCTCGAGGCC ACACAGGGGC TGCTGGCCGG CGAGTGGGCC CCACCCCTCT 480 

GGRAGCTGGG CAGCCTCTTC CAGGCCTTCG TGAAGAGGGA GAGCCAGGCT TATGCGTAAG 540 

35 CTTCATAGCT TCTGCTGGCC TGGGGTGGAC CCAGGACCCC TGGGGCCTGG GTGCCCTGAG 600 

TGGTGGTAAA GTGGAGCAAT CCCTTCACGC TCCTTGGCCA TGTTCTGAGC GGCCAGCTTG 660 

GCCTTTGCCT TAATAAATGT GCTTTA'ITTT CAAAAAAAAA AAAAAAAAAC T 711 

40 



(2) INFORMATION FOR SEQ ID NO: 71: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

GGCACAGGGT GAAAGCCAGC TAAACCCCAA GTGGAGAAGT GAAAGACATG GTTGTTCCCA 60 

55 

TAAGTTTATT GCTCACATTA TGAAAGAAGC CATAGTCATG AGTGAACCAC TCCCTAGCIT 120 
GATAAGGAAA CCAACACGGA AGATCTCTTT CTGGAAGAAG CAGCCAGCCT CGTGAAGGAG 180 



60 



CGGCCCAGCC GCCGGGCCCG AGGGTCGCCT TTTGTTCGGA GTGGCACGAT TGTCCGTTCC 



240 
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CAGACA1TCT CGCCTGGAGC ACGAAGCCAG TATGTTTGCA GACTTTATCG TAGTGACAGC 300 

GACAGTTCAA CGCTGCCCCG GAAGTCCCCC TTTGTCCGAA ATACTTTGGA AAGACGAACC 360 

5 

CTTCGCTATA AGCAGTCATG CAGGTCTTCC CTGGCTGAGC TCATGGCCCG CACCTCCCTG 420 

GACTTGGAGC TGGATCTCCA GGCGTCGAGA ACACGGCAGA GGCAGCTGAA TGAGGAGCTC 480 

10 TGCGCCCTCC GTGAGCTGCG GCAGCGGTTN GGAGGACGCC CAGCTCCGTG GCCAGACTGA 540 

CCTCCCACCC TGGGTGCTTC GGGACGAGCG GCTCCGTGGC CTGCTGCGGG AGCCGAGCGG 600 

CAGACAAGAC AGACCAAACT TGACTACCGT CATGAGCAGG CGGCTGAGAA GATGCTGAAG 660 

15 

AAGGCCTCCA AGGAGATCTA CCAGCTGCGT GGCAGAGCCA CAAAGAGCCC ATCCAAGTGC 720 

AGACCTTTAG GGAGAAGATA GCATTCTTCA CAAGGCCAAG GATCAACATA CCTCCTCTCC 780 

20 CAGCCGACGA CGTCTGATGG AGTGCATTGT GCACATGAAG TATTTATCCA CCTGTTTTAT 840 

TTTCATGAAG TTCTTAGACT AGCTGAATTT GTCTTTAAAA TATITGTGCA AAGCTATTAA 900 

TATACACATT TTGTAAAAAA AAAAAAAAAA AAACT 935 

25 



(2) INFORMATION FOR SEQ ID NO: 72: 

30 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

GCAGGGGCGA GGGGYTGGGG ACCGCGGGGC GGACGGGAGC GAGTATGTCC GCTCTGACTC 60 

40 

GGCTGGCGTC TTTCGCTCGC GTTGGAGGCC GCCTTTTCAG AAGCGGCTGC GCACGGACTG 120 

CTGGAGATGG TGGAGTCCGT CATGCCGGTG GTGGTGTGCA CATTGAGCCC CGGTATAGAC 180 

45 AGTTCCCCCA GCTGACCAGA TCCCAGGTGT TCCAGAGCGA GTTCTTCAGC GGACTCATGT 240 

GGTTCTGGAT TCTCTGGCGC TTTTGGCATG ACTCAGAAGA GGTGCTGGGT CACTTTCCGT 300 

ATCCTGATCC TTCCCAGTGG ACAGATGAAG AATTAGGTAT CCCTCCTGAT GATGAAGACT 360 

50 

GAAGGTGTAG ACTCAGCCTC ACTCTGTACA AGAGCCAGGT GAGAATTTCA AGGATTATCG 420 

ACTTCATATT GCACATTAAA GTTACAAATT AAAGTGGCTT GGTCAAGAAT GARAAAAAAA 480 

55 AAAAAAAATT GGGQGGGGGC CCCN 504 



60 (2) INFORMATION FOR SEQ ID NO: 73: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

10 GAATTCGGCA CGAGGAGGAG GGGAGGCGGG GTAAGTTTGG TGGGAAACTC TGTAATTTCC 60 

WTTTTTACTT TCACAGCAAT AGTGCAGAAT CCAGAATGGA TGTCCTCTTT GTAGCCATCT 120 

^ TTGCTGTGCC ACTTATCCTG GGACAAGAAT ATGAGGATGA AGAAAGACTG GGAGAGGATG 180 

AATATTATCA GGTGGTCTAT TATTATACAG TCACCCCCAG TTATGATGAC TTTAGTGCAG 240 

ATTTCACCAT TGATTACTCC ATATTTGAGT CAGAGGACAG GCTGAACAGG TTGGATAAGG 300 

20 ACATAACAGA AGCAATAGAG ACTACCATTA GTCTTGAAAC AGCACGTGCA GACCATCCGA 360 

AGCCTGTAAC TGTGAAACCA GTAACAACGG AACCTCAGAG TCCAGATCTG AACGATGCCG 420 

TGTCCAGTTT GCGAAGTCCT ATTCCCCTCC TCCTGTCGTG TGCCTTTGTT CAGGTGGGGA 480 

25 

TGTATTTCAT GTAGAAGGTG GAAGAAGGCT GCTATGACTC TTTQGATGGG AGTCTGGCAA 540 

GAGGAAATTG GAAGATAAAA TAAATAATAA GTGAAATAAA AAAAAAAAAA AAAAACTCGA 600 

30 GGGGGGGCCC GGTACCCAAT 62 o 



35 (2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 581 base pairs 
<B) TYPE: nucleic acid 
40 <C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

45 ACAAGGTGTG TGTAAAGTTT ATGTTTGTAA ACTGAATTCT ATCTTAAATC CAAAAAGAAC 60 

TCGGGAGTAA TTCATTTTTG TAGCATAAAG ATCCCTAAGT TTTATTTTGA AATATCTGAT 120 

TTTTACACGT TAAAAAATAA CAGGGCATCG AGAGGATTCC TAGGTGACAT CCAGACTCCT 180 

50 

TTAGCTTTGT GTGTGTGGCA CCGGTTAGTC TGCTTCTCTC TCCTTTCTTG CACTGCTTCA 240 

CACAGCCATG CCCTGCCAGC CCGGGCAGGT GCCTTCCTGT CAATGTACAT TTGGGCTTCT 300 

55 GCTCATGCTG CCCTCCCTCC CCTCCCCTGC CTCCCAACCC CGCCCCTTTT GTTCCTCCAT 360 

GGAGTACTTC CATGGGTGTG CCTCCCCCAG CCAAGCCATA ATAGGTGGTT TCCCCTTCGC 420 

TTCTGTAGCC CTTGCAGACA TCCTCTGTTT ACACTAGGTG TTGACTTACT TCCCCTCTCC 480 



60 
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CCGSTAAAGC CATAAACTCC TTAAGGACAG GTAGCATTCT TAGTATCTTC GTTCTTCTCA 540 
ATGACCAGTA GACCATTAAA CATGTAGCAA ACAAATGTGA A 581 

5 

t 

(2) INFORMATION FOR SEQ ID NO: 75: 

10 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

AAACCCAACN CCCTCCGGTC CCCNAAAGAA AGCCCAGCCC AAATCCCAAG CCGGCAGTGA 60 

20 GCCCGCGAAC AAGGCCCTCA AGACGCCCAG NCGAACAAGC AGCCCCCAGG AGGCCCCGCA 120 

AGAGAACTCC CTGGCGGCCC AAGCGGGCAG CTTCTGTGCG GCAGAACTCA GCCACCGAGA 180 

GCGCAGACAG CATCGAGATT TATGTCCCGG AGNCCCAGAC CAGGCTCTGA GACCATGCAG 240 

25 

GAGGAAAGAA ACGATTTTAA ATCATTAAAA ACACAAAAAC TAAGTGCGAA CGGAACAGAG 300 

TTTTCTCAAC CTTTGCTATG GTTATTCTGT CTAGAGACCC TGAGCCAACT TTCAAATTGA 360 

30 CGCATACAAG GGCTCACAAT TTGGCTTTTT TGGGTCCCTC CCAGCTTTAG GTTTATGAAGA 420 

TTTTACTCAC AAAAAAAATC AACAAAAATC ACGAAACTAG AAAACTTTTT TTTTCCTCTT 480 

GCTGGCCGTG GTGGACTAGA TAGATGGACG TCGGCAACTC CCGGCCCAGC CTCCATACTG 540 

35 

CGGTCTTTTT ACTCGTTCTA TCTGATGAGA ACTCACACTA GCTTGTTTAC AAGATGACGA 600 

CAGTCCAAGG GCAGCCTTGG GCACCTGCCA TGTCCCTCCT TTCCCCAGCT ATCCCCGCTC 660 

40 TGACCTTGAT TTTCATTCTT ATGTTTTTCT CTTTTCCCTT CAGAGCTCAC ACAGTGGTCA 720 

CCATTGTGGC AAGCGGCTTT CTGGGTCTCA GCCCTCTCTG CGGTTGAGGG CCCAGAGGAC 780 

AGAGAGATGG ACATGCGTCC CCTCCCTCCC CCCGCCAAGT GCTCACACAC AACCTCACGC 840 

45 

GCACACACAC ACACGCAGAT GGAGGCGCCT CACTGGGAGG TGCCCCGCCA GCCCTGGGCA 900 

GTGTCAGGCA GGACTCACTC ACCGCTGAGC AGATGAGAGA AGTTTTAGTC TTGGCGGGTG 960 

50 GAAATGAGAC GAAGCCACAG TTATCACACT CCAGACTCCT GCCCTTTTAT TTTCTCCAGC 1020 

CCCTTCTTCC TTCAGCAAAA TCTAGGACTC CCGAGTGGCT TCCAGGGGGC CGTCAGTCCT 1080 

CAGCCGCGCC TGTGTCCGGT GCCCGAGGGG CGGGCGGCGG TGTCTGTATG TATGTGTACA 1140 

55 

TATGCACATA GACCTTAGAG TGTATAGTTA ACAAACGCCC ATCTGCTCAC CCATGCCCAC 1200 

CCAGCGCCGC CGCCGCTGGC TCTCGGQGCA CCTGGCAGGA GGCGGGTGTG TGAATAGCAT 1260 

60 ATATTTTTAC ATGTACTATA TCTAGGTGTG TGTACAAGTG TGTGTAAAAA TATATACCTT 1320 



WO 98/42738 



PCT/US98/05311 



225 



GTGTGTAAGC AGCCCTTTTT TTTTTTGGTC TCCACCCCCC TCCCCCCGCC CCGCACTCCT 1380 

AAGGGCCCAT CTGCCCAGCC TCTGAGTTTT CTGTTCTATT TTTTTTTTAA CCCCAATTAT 1440 

5 

CCTTCTCTCT CTCCTGCCCC CGCATCCCAC TCCCAGGGTG TCACGAGCCC TGAGCTGCAA 1500 

TGGCCCQGGC CTGCAGGGCG GGGTAGGGGA QGGCARGGCT SAGCCCCGAA GCCAGCTCAG 1560 

10 TACCTGAGGG GCTGCTCTAT GCTGTGTATG CGCCTCTCTG GCATCCGAGA CATCCTCTTG 1620 

GTGGCGCTTG CTNGCAGGGG ACCCCCCCCC CGTCCCCAGG TGAACCAAGG GTCTGCTCCG 1680 

GGGCCCATTT CCAGCTTGGC CGCCGTCTGT GACCTTGGGC AAGTCACTTG ACCTCTGTGT 1740 

GCCTCAACTT CCTCCTCTGT AAAACGGGGA CAGTCCCTGC CCCTCCCTAC CTCACAGGCA 1800 

TGTTGTGAGA ATAAATGAGG TAACGTGTAA AAAAAAAAAA AAT 1843 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 76: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
TCGACCCACG CGTCCGGCTC CCCGAGCCCT GCCAACCATG GTGAACTTGG GTCTGTCCCG 60 
35 GGTGGACGAC GCCGTGGCTG CCAAGCACCC GGGACTCGGG GAGTATGCCG CATGCCAGTC 120 
ACACGCCTTC ATGAAGGGCG TTTTCACCTT CGTCACAGGC ACCGGCATGG CCTTTGGCTT 180 
GCAGATGTTC ATTCAGAGGA AGTTTCCATA CCCTTTGCAG TGGAGCCTCC TAGTGGCCGT 240 

40 

GGTTGCAGGC TCTGTGGTCA GCTACGGGGT GACGAGAGTG GAGTCGGAGA AATGCAACAA 300 
CCTCTGGCTC TTCCTGGAGA CCGGGCAGCT CCCCAAAGAC AGGAGCACAG ATCAGAGAAG 360 
45 CTAGGAGAGC TCCAGCAGGG GCACAGAGGA TTGGGGGCAG GAGGAGTCTG GAACACAGCC 420 
TTCATGCCCC CTGACCCCAG GCCGACCCTC CCCACACCCT AGGGTACCCC AGTCGTATCC 480 
TCTGTCCGCA TGTKTGGCCA GGCCTGACAA ACACCTGCAG ATGGCTGCTG CCCCAACCTG 540 

50 

GGACCTGCCC AGRAGGTTGG AGCAGAAAGG GCTCTCCCTG GGGTGGTGTT TCTCCTCTAG 600 
GGTATTGGGA TGCATGTTCT GCACTGCCAG CAGAGAGGGT GTGTCTGGGG GCCACCACCT 660 
55 ATGGGACACG GGGTCGAAGG GGCCTGTACA CTCTGTCATT TCCTTTCTAG CCCCTGCATC 720 
TCCAACAAGT CCAAGGTGAC AGCTGGTGCT AGGGGCGTGG GGTTAATAAA TGGCTTATCC 780 
TTCTCTCCAC CCAAGTTTCC ACCTGACCAG GTGAAAAACA AATCAGAAGG GTAAGATGAT 840 

60 
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GACAGGTCAC ATGAAACCTT TATTACCCTA CAGTTGATAT ATGAGGATCA CATGCAAGTT 900 



ACATACTGAG GATGTACAGG GAAGTTCCCA GCGCTGAACC CCAGAATTAG ACGTTCGCAT 960 



5 CAGCCCCGTA GGCCACGTGG ACACCACCAC AGCCTCTCTG TATGGGGGTC TGCCTCTGTA 1020 



GCACTTGGCA TGTAGGGGCA GAGCAAAAGG GGCCANGCTG GCCAGAGCCT GGCTGCTGGG 1080 



NAGARGAGGG ACTTGTGGGS CACGCCACNT GCCTATCATT CCCCAYTCAT CTATTAGCCA 1140 

10 

AAGTCACTCC CCAGAGGCAG AGCTAGCCCG TTGTAGCCGT GTCTGTGTGG AGGGAAAGCT 1200 



TCTGAGTGGG CAAGCCTACA CACAGCCCCG AGCCCCAAGA GGAGGAAGAG GTGGAGACCA 1260 



15 GACGGAACCT CCACAAGTCC ATCATGGTTA CAGCTGGCTT CCCCGCAGCA CCGAAGACCC 1320 



ACAGCATNGG CCCTGCTGCC CCCGACCCAG CTCAGCTGCC ANGCCTCACC TTGCCAGGAA 1380 



TTGAAAGAAA GTTATTGAGT ACTAATTGGC CTCAGAGTNA CAGGAAGCTC AAGTTAAAGT 1440 

20 

G 1441 



25 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

35 

GGCAGAGCTG GCCTTCGACT CGCTATGTCC ACTAACAATA TGTCGGACCC ACGGAGGCCG 60 
AACAAAGTGC TGAGGTGAGG ACCCCAGCGT CGTGGGCACG GGTTCGGGTT GTGGGTGTGG 120 
40 ATCGGGGCCC TGGGAAGCGC CTGTCTATCC CGGGGGCAGG ACCTGAGCGC CCCTGACCCT 180 



CGAGCCTGTC GCAGGTACAA GCCCCCGCCG AGCGAATGTA ACCCGGCCTT GGACGACCCG 240 



ACGCCGGACT ACATGAACCT GCTGGGCATG ATCTTCAGCA TGTGCGGCCT CATGCTTAAG 300 

45 

CTGAAGTGGT GTGCTTGGGT CGCTGTCTAC TGCTCCTTCA TCAGCTTTGC CAACTCTCGG 360 



AGCTCGGAGG ACACGAAGCA AATGATGAGT AGCTTCATGT GAGACTTGCC CTACAGAACA 420 
50 AGTGACTCTT GAGTAAGGGG TGGGGGGACC CCAGCCTGGC CATCCTAGAC TGACACCTCT 480 



CTCCTGTCTT CATGCTGTCC ATCTCTGCCG TGGTGATGTC CTATCTGCAG AATCCTCAGC 540 



CCATGACGCC CCCATGGTGA TACCAGCCTA GAAGGGTCAC ATTTTGGACC CTGTCTATCC 600 

55 

ACTAGGCCTG GGCTTTGGCT GCTAAACCTG CTGCCTTCAG CTGCCATCCT GGACTTCCCT 660 

GAATGAGGCC GTCTCGGTGC CCCCAGCTGG ATAGAGGGAA CCTGGCCCTT TCCTAGGGAA 720 



60 CACCCTAGGC TTACCCCTCC TGCCTCCCTT CCCCTGCCTG CTGCTGGGGG AGATGCTGTC 780 
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CATGTTTCTA GGGGTATTCA TTTGCTTTCT CGTTGAAACC TGTTGTTAAT AAAGTTTTTC 840 

ACTCTGAAAA AAAAAAAAAA AAAAAAAAAC TYGRGGQGGG GCCCGGAACC CAATTCSCCG 900 

GATAGTGAGT 91Q 

(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2776 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

TCGACCCACG CGTCCGGGCG GGCAGTGATG GCGGCTGGTG ATGGGGACGT GAAGCTAGGC 60 

ACCCTGGGGA GTGGCAGCGA GAGCAGCAAC GACGGCGGCA GCGAGAGTCC AGGCGACGCG 120 

25 GGAGCGGCAG CGRAAGGGGG AGGCTGGGCG GCGGCGGCGT TGGCGCTTCT GACGGGGGGC 180 

GGGGAAATGC TGCTGAACGT GGCGCTGGTG GCTCTGGTGC TGCTGGQGGC CTACCGGCTG 240 

^ TGGGTGCGCT GGGGGCGGCG GGGTCTGGGG GCCGGGGCCG GGGCGGGCGA GGAGAGCCCC 300 

GCCACCTCTC TGCCTCGCAT GAAGAAGCGG GACITCAGCT TGGAGCAGCT GCGCCAGTAC 360 

GACGGCTCCC GCAACCCGCG CATCCTGCTC GCGGTCAATG GGAAAGTCTT CGACGTGACC 420 

35 AAAGGCAGCA AGTTCTACGG CCCGGCGGGT CCATATGGAA TATTTGCTGG TAGGGATGCC 480 

TCCAGAGGAC TGGCCACATT TTGCCTAGAT AAAGATGCAC TTAGAGATGA ATATGATGAT 540 

CTCTCAGATT TGAATGCAGT ACAAATGGAG AGTGTTCGAG AATGGGAAAT GCAGTTTAAA 600 

40 

GAAAAATATG ATTATGTAGG CAGACTCCTA AAACCAGGAG AAGAACCATC AGAATATACA 660 

GATGAAGAAG ATACCAAGGA TCACAATAAA CAGGATTGAA CTTTGTAAAC AACCAAAGTC 720 

45 AGGGGCCTTC AGAACTGCAA TTCTTACTCC CTTTCACAGA CTGTCCGGAG TCTTTGGGTT 780 

TGATTCACCT GCTGCGAAAA ACATTCAACA AATTGTGTAC AAGATAAATT AATCTCACTA 840 

^ TGAAGATTTG AATAACTAGA CATTATTTAT GCTGCCAAAC TCATTTGTTG CAGTTGTTTG 900 

TAATGTCTAG TGGGGCTTCA TCATCCTGAA AAGAAGGAGA CAGGGATTTT TTTAAAGAGC 960 

AAGAAAGTCA CAATATTACT TCTTTCCTTC CTTTTTTCCT TCTTTCCTTT CTTCTTTCTC 1020 

55 TTTCTETCTT TTTAAAATAT ATTGAAGACA ACCAGATATG TATTTGCTAC TCAAGTGTAC 1080 

AGATCTCCTC AAGAAACATC AAGGGACTCC TGTGTCACAT ACTGTGTTTT TATTTTAACA 1140 

TGGGTGAGGG AGGCGACCTG ATCAGGGGAG GTGGGGGTAC ACATCAATTT GAGTTGTTCA 1200 

60 
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GGCTACTGAA ACATTAAAAT GTGAATTCCC AAACTTTTCT TTTTGGCTTT GTCAGGGAAA 1260 

AGAAAAATAT CTTTATAAAG AAATCTTTGG AAATTAQGAG AAGGAATTTC AGGTGGGTTT 1320 

5 AAGTCAGAGC TAGTTCCCCA ACAGAAAGAT CATTTGAAAC CAGTTTTTAT CCCTTCTCTT 1380 

TCCTTCCCTT TCCCTAAATC AAATCAATAT TAATTGTGCC TTATTTCACT TAACATAGAC 1440 

TTGAATTATT TTTAGGGAAA GCCCCTATAA TGAATTCAGA AATCACTACA AGCAGCATTA 1500 

10 

AGACTGAAGT TGGAATATTC TGTTGACCAT AAAACCTTGA TATCATTCTG TGTATATAGA 1560 

ATGTAAAAGG AATATTACAG TCTTAACTGC CATATATGTA ATATACACAA ACTCAATTAG 1620 

15 CATTGTAATG GCCAAATGCA TTCCCCCATG CTTTTCTGTT TTCAAAAAAA TTGAAAAACA 1680 

AATCAACTCT TATCCCCAAC AGCTGCCTAA TTTTAGGAGT CTGACCCTCC ACATCTCACT 1740 

GGTGTGGGTG CATGGGGCTG TGGAGTGGGT GTCAGTATGG ATGTGTCTGA ATGTGTGAGG 1800 

20 

CCTTGGAAGG GACTCTTTCT GCAGATACTG TAAATACAAG TACCATTTTA ATAAAGCATG 1860 

TACAATAAAC CAAAATAAGC TTGAGTTGGA CTTTATATAC AGAACTGTAA GCCAGTGCAT 1920 

25 TATGATACAG TTGTAAGATT GTGCATTTGA TTCAAGATAA GGAAAAATCT TGGAAATGAA 1980 

AAGCAGGCAC KGGTTAACCA AGTTGTACAC ATTGTACCAC ATTCAGCATA ACTTTAGGAA 2040 

GAAATTCCAC TTTGTGAACA TTCTCCAGAA ATCCAAGATT ATTCAGGTAA GAATTGGTAT 2100 

30 

ATTAAATGTA CATCTTTTTA CTTTCTATTT TGATGCCAAC TGATTATACT AGACAATTAG 2160 

CACTCCAGGT GGTTATTGAA CACAAAACAG TAAAAGAATA TTGCACTGAT AGATACTAAA 2220 

35 TTATTATTTT ATTAGGTTGA AAAAGCCCTT ACTAAAAGCC CCTCATATAT CAATTACTTT 2280 

ATTTCATTAT GACTACTTAG GTTCCGGGCT GGGGACAAGT TCACTTAAAA AGGCAATGTT 2340 

ATTTAACAGG TCACCAGTTA AGACTTCTGC TTTGTAGATA CATGCAGAAG CCATCAAACA 2400 

40 

AGGGGGRGCT TTTAACTGCA ACAATAAGCT AAAGTATGTA AAATACTACA TTCTATTCAG 2460 

TCTTGGAGTG TTTTGTAGAA AGTTATCTTC AGCCAAATCT TTGCTGAAGA CTGGTTGTGG 2520 

45 AGTGTTGGTA AATGCTTTGT GTTTTTATGT AAAATATTTT CTAAACAAAA AATGTTAAAA 2580 

GTACATGTCC TCTGTAGTAA ACTGATATCT ATATATATGA ATCATTCAAG CCTAAAGTCT 2640 

AGTAATAAAC TGTACTTGTG AATAGAGAAA CCCTAAATAT TCATGCAGWA AAAATTATGC 2700 

50 

GGTCTGTTAA GAAAAATGAG TAATTTGTGT TTTGGACTTG AAATAAACAG TGTTCTGTAG 2760 

ATAATTCCTC AAdTC 2776 

55 

(2) INFORMATION FOR SEQ ID NO: 79: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEnWESS: double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

CCGCTGCTGA TAACTATGGC ATCCCCCGGG CCTGCAGGAA TTCGGCACGG AGCTACGGCG €0 

10 CCGCCTGGCT CCTGCTGNCA CCTGCAGGCT CGTCGCGGGT GGAGCCCACC CAAGACATCA 120 

GCATCAGCGA CCAGCTGGGG GGCCAGGACG TGCCCGTGTT CCGGAACCTG TCCCTGCTGG 180 

TGGTGGGTGT CGGCGCCGTG TTCTCACTGC TATTCCACCT GGGCACCCGG GAGAGGCGCC 240 

15 

GGCCGCATGC GGASGAGCCA GGCGAGCACA CCCCCCTGTT GGCCCCTGCC ACGGCCCAGC 300 

CCCTGCTGCT CTGGAAGCAC TGGCTCCGGG AGCSGGCTTT CTACCAGGTG GGCATACTGT 360 

20 ACATGACCAC CAGGCTCATC GTGAACCTGT CCCAGACCTA CATGGCCATG TACCTCACCT 420 

ACTCGCTCCA CCTGCCCAAG AAGTTCATCG CGACCATTCC CCTGGTGATG TACCTCAGCG 480 

GCTTCTTGTC CTCCTTCCTC ATGAAGCCCA TCAACAAGTG CATTGGGAGG AACATGACCT 540 

25 

ACTTCTCAGG CCTCCTGGTG ATCCTGGCCT TTGCCGCCTG GGTGGCGCTG GCGGAGGGAC 600 

TGGGTGTGGC CGTGTACGCA GCGGCTGTGC TGCTGGGTGC TGGCTGTGCC ACCATCCTCG 660 

30 TCACCTCGCT GGCCATGACG GCCGACCTCA TCGGTCCCCA CACGAACAGC GGACTKTCGT 720 

GTACGGCTCC ATGAGCTTCT TGGATAAGGT GGCCAATGGG CTGGCAGTCA TGGCCATCCA 780 

GAGCCTGCAC CCTTGCCCCT CAGAGCTCTG CTGCAGGGCC TGCGTGAGCT TTTACCACTG 840 

35 

GGCGATGGTG GCTGTGACGG GCGGCGTGGG CGTGGCCGCT GCCCTGTGTC TCTGTAGCCT 900 

CCTGCTGTGG CCGACCCGCC TGCGACGCTG GGACCGTGAT GCCCGGCCCT GACTCCTGAC 960 

40 AGCCTCCTGC ACCTGTGCAA GGGAACTGTG GGGACGCACG AGGATGCCCC CCARGGCCTT 1020 

GGGGAAAAGC CCCCACTGCC CCTCACTCTT CTCTGGACCC CCACCCTCCA TCCTCACCCA 1080 

GCTCCCGGGG GTGGGGTCGG GTGAGGGCAG CAGGGATGCC CGCCAGGGAC TTGCAAGGAC 1140 

45 

CCCCTGGGTT TTGAGGGTGT CCCATTCTCA ACTCTAATCC ATCCCAGCCC TCTGGAGGAT 1200 

TTGGGGTGCC CCTCTCGGCA GGGAACAGGA AGTAGGAATC CCAGAAGGGT CTGGGGGAAC 1260 

50 CCTAACCCTG AGCTCAGTCC AGTTCACCCC TCACCTCCAG CCTGGGGGTC TCCAGACACT 1320 

GCCAGGGCCC CCTCAGGACG GCTGGAGCCT GGAGGAGACA GCCACGGGGT GGTGGGCTGG 1380 

GCCTGGACCC CACCGTGGTG GGCAGCAGGG CTGCCCGGCA GGCTTGGTGG ACTCTGCTGG 1440 

CAGCAAATAA AGAGATGACG GCAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1500 

AAAAAAAAAA AAACCCACCG TCCGC 1525 



55 



60 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

AATTCGGCAC GAGNCAGAAA CCTGCGGAAA ATGGTAQCGA TGGCGGCTGG GCCGAGTGGG 60 

15 TGTCTGGTQC CGGCGTTTGG GCTACGGTTG 1TGTTGGCGA CTGTGCTTCA AGCGGTGTCT 120 

GCTTTTGQGG CAGAGTTTTC ATCQGAGGCA TGCAGAGAGT TAGGCTTTTC TAGCAACTTG 180 

CTTTGCAGCT CTTGTGATCT TCTCGGACAG ITCAACCTGC TTCAGCTGGA TCCTGATTGC 240 

20 

AGAGGATGCT GTCAGGAGGA AGCACAATTT GAAACCAAAA AGCTGTATGC AGGAGCTATT 300 

CTTGAAGTTT GTGGATGAAA ATTQGGAAGG TTCCCTCAAG TCCAAGCTTT TGTTAGGAGT 360 

25 GATAAACCCA AACTGTTCAG AGGACTGCAA ATCAAGTATG TCCGTGGTTC AGACCCTGTA 420 

TTAAAGCTTT TGGACGACAA TGGGAACATT GCTGAAGAAC TGAGCATTCT CAAATGGAAC 480 

ACAGACAGTG TAGAAGAATT CCTGAGTGAA AAGTTGGAAC GCATATAAAT CTTGCTTAAA 540 

30 

TTTTGTCCTA TCCTTTTGTT ACCTTATCAA ATGAAATATT ACAGCACCTA GAAAATAATT 600 

TAGTTTTGCT TGCTTCCATT GATCAGTCTT TTACTTGAGG CATTAAATAT CTAATTAAAT 660 

35 CGTGAAATGG CAGTATAGTC CATGATATCT AAGGAGTTGG CAAGCTTAAC AAAACCCATT 720 

TTTTATAAAT GTCCATCCTC CTGCATTTGT TGATACCACT AACAAAATGC TTTGTAACAG 780 

ACTTGCGGTT AATTATGCAA ATGATAGTTT GTGATAATTG GTCCAGTTTT ACGAACAACA 840 

40 

GATTTCTAAA TTAGAGAGGT TAACAAGACA GATGATTACT ATGCCTCATG TGCTGTGTGC 900 

TCTTTGAAAG GAATGACAGC AGACTACAAA GCAAATAAGA TATACTGAGC CTCAACAGAT 960 

45 TGCCTGCTCC TCAGAGTCTC TCCTATTTTT GTTA1TACCCA GCTTTCTTTT TAATACAAAT 1020 

GTTATTTATA GTTTACAATG AATGCACTGC ATAAAAACTT TGTAGCTTCA TTATTGTAAA 1080 

ACATATTCAA GATCCTACAG TAAGAGTGAA ACATTCACAA AGATTTGCGT TAATGAAGAC 1140 

50 

TACACAGAAA ACCTTTCTAG GGATTTGTGT GGATCAGATA CATACTTGGC AAATTTTTGA 1200 

GTTTTACATT CTTACAGAAA AGTCCATTTA AAAGTGATCA TTTGTAAGAC CAAAATATAA 1260 

55 ATAAAAAGTT TCAAAAATCT ATCTGAATTT GGAATTCTTC TGGTTTGTTC TTTCATGTTT 1320 

AAAAATGATG TTTTTCAATG CATTTTTTTC ATGTAAGCCC TTTTTTTAGC CAAAATGTAA 1380 

AAATGGCTGT AATATTTAAA ACTTATAACA TCTTATTGTT GGTAATAGTG CTTTATATTT 1440 

60 
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GTCTGATTTT ATTTTTCAAA GTTTETTCAT TTATGAACAC ATTTTCATTG GTATATTATT 
TAAGGAATAT CTCTTGATAT AGAATTTTTA TATTAAAAAT GATTTTTCTT TGCTTAAAAA 
5 AAA 

10 (2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
TGCACGCTGG CCATGTGGGN GTTGGGCCAC TGCGACCCCC GGCGCTGCAC GGGCCGCAAG 
tTGGCCCGCC TGGGGCTGGT GCGCTGCCTG CGCCTGGGCC ACAGATTCGG CGGTCTGGTG 
CTGAGCCCCG TGGGCAAGCA GTACGCGTCC CCCGCAGACA GACAGCTGGT GGCGCAGTCT 
GGGGTCGCCG TCATCGACTG CTCCTGGGCC AGGCTGGACG AGACACCGTT TGGGAAGATG 
CGAGGGAGCC ACTTGCGCCT GTTGCCCTAC CTGGTGGCCG CCAACCCCGT GAACTATGGC 
CGGCCCTACA GACTTTCCTG CGTGGAAGCG TTTGCTGCCA CCTTCTGCAT CGTAGGCTTT 
CCAGACCTTG CTGTCATTTT GCTGCGGAAG TTTAAATGGG GCAAGGGCTT CTTGGACCTG 
AACCGCCAGC TCCTGGACAA GTACGCGGCC TGCGGCAGCC CGGAGGAGGT GCTGCAGGCG 
GAGCAGGAGT TCTTGGCCAA TGCCAAGGAG AGCCCCCAGG AGGAGGAGAT CGATCCCTTC 
GATGTGGATT CAGGGAGAGA GTTTGGAAAC CCCAACAGGC CTGTGGCCAG CACCCGGCTG 
CCCTCGGACA CTGATGACAG TGATGCGTCT GAGGACCCAG GGCCTKGCGC CGAGCGCGGA 
GGAGCCAGCA GCAGCTGCTG TGAAGAGGAG CAGACGCAGG GACGGGGGGC TGAGGCCAGG 
GCCCCGGCTG AGGTTTGGAA AGGAATCAAG AAACGGCAGA GAGACTGAGG GTTGCAGACA 
CATATATTTT TGAGGCTGGG TGACGAGAAA ATCTAGAGAC ATGAGGGACA TAAATGGGCC 
TGGCAGCCTC GGCTCTTTGC GGCTGCTGGC AGGACTGAGC TGTCCGGGTT CTCCCCACAC 
TTCCAGCACA GCTGTGCTCT GTGTCCTGCC TCGGCGCTCT CGCAAATGAA GCTQCAGGCC 
AAGAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAG GGGGGGGGGC 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

TCGACCCACG CGTCCGGGCC GCCGTAGCGC GTCTTGGGTC TCCCGGCTGC CGCTGCTGCC 60 

GCCGCCGCCT CGGGTCGTGG AGCCAGGAGC GACCTCACCG CCATGGCAGG CATCAAAGCT 120 

10 

TTGATTAGTT TGTCCTTTGG AGGAGCAATC GGACTGATGT TTTTRATGCT TGGATGTGCC 180 

CTTCCAATAT ACAACAAATA CTGGCCCCTC TTTGTTCTAT TTTTTTACAT CCTTTCACCT 240 

15 ATTCCATACT GCATAGCAAG AAGATTAGTG GATGATACAG ATGCTATGAG TAACGCTTGT 300 

AAGGAACTTG CCATCTITCT TACAACGGGC ATTGTCGTGT CAGCTTTTGG ACTCCCTATT 360 

GTATTTGCCA GAGCACATCT GATTGAGTGG GGAGCTTGTG CACTTGTTCT CACAGGAAAC 420 

20 

ACAGTCATCT TTGCAACTAT ACTAGGCTTT TTCTTGGTCT TTGGAAGCAA TGACGACTTC 480 

AGCTGGCAGC AGTGGTGAAA AGAAATTACT GAACTATTGT CAAATGGACT TCCTGTCATT 540 

25 TGTTGGCCAT TCACGCACAC AGGAGATGGG GCAGTTAATG CTGAATQGTA TAGCAAGCCT 600 

CTTGGGGGTA TTTTAGGTGC TCCCTTCTCA CTTTTATTGT AAGCATACTA TTTTCACAGA 660 

GACTTGCTGA AGGATTAAAA GGATTTTCTC TTTTGGAAAA AAAAAAAAAA AAAAACYCGA 720 

30 

GGGGGGGCCC GTWCCCATTC SCCCYATATG AATTCCNTTT TTACAATCCC 770 



35 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

45 

GAATTCGGCA CGAGCATAGT GTTAACCACT AGAATTCACT GCCCTTCCTA TCCAAAAATG 60 

ACACTACTGA TCATrTTTCT TCCTTTTSCT TTTACAACAT TMACAAATTC AGGTGGCTCT 120 

50 TTCCCAGTAC GGTAGGCTGA TTCGTATGGA TGCACCACGG TTGGTGACTC CCCCCACCCC 180 

ACAGAGTTTC TGGCGTTCAT TCGGTTGAAC CCAAGGCCAG CAAGGGCTGA CTGGGAACAA 240 

ACCGAACACT AGGCCGTGAA CCAATCGTCT CTCCGTGCCC GGGAGCGAMC CCGGGQGCCT 300 

55 

TTCACTCTCC CAAGGACTCC ANGGGGGGGC CGGGTACCCA ATTCCGCCCC TATAGTGAAT 360 

CCGTNATTAC AATTCCACNT GGGCCGTCCN TTTTTACAAA CGTTCCGTTG AACTGGGAAA 420 

60 AACCCCTTGG CGGTTTACCC CAACTTTAAT CCGCCTTTGC AAGCACATCC CCCCCCTTTT 480 
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(2) INFORMATION FOR SEQ ID NO: 84; 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

GCTGGGATAG AGCATGAAAG GAGAACTGCT CCCTTTTCTG TTTCTCACAG TTTGGTTATG 60 

GCTTTATAAA CTTKTATTTG GTGAAAGCCC CAGATACCCA AATGTCATTG GCAAAACTTA 120 

20 

TTTTTTTTTC TGGACAGATC AGATTTCTAG AGAGAGCAGA TTTCTAGAGA GATTAGCATT 180 

CATAGTAAGT GAAAATTGTC TAATTTTTTT AATCCATGCT ATTACTGGGC AGTAGGTCTA 240 

25 A'rrri'lTTTG ACAAAAAATA GATCTATTTT CCTTATATAT TGATTTAGAA TCTTAAGTTA 300 

GAATTTTATA GAAGAAATGT CTGAGCAGTT CTATGTATGG AGGAGCAATT CAGCTTTTCA 360 

GCAGCAACTT TATCTTTTGC CACTAGAGGG AGATCTGTGG TTGCTTTCTC CTTTGGAGAA 420 

30 

TAGCTGCTTT GCTTTTATTT TTAATTTCTA AGGTTGGAAT AGAACTTATT CTCAAAATTC 480 

CTTTAGTGTT ATTAAATATT TTCATTTATT AGTCAAAGGT AAGTTAATTA AGCTTGTTTA 540 

35 ATGATGCCAA TCTTATGCTT TTCTGTAATC TTCAATTTTT AATAAATGTG AGTTAGATAC 600 

TAAGTGAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 644 



40 



(2) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

GGCACGAGTG CGCASGCGTG GGGCTCTCTC CTTGTCAGTC GGCGCCGCGT GCGGGCTGGT 60 

GGCTCTGTGG CAGCGGCGGC GGCAGGACTC CGGCACTATG AGCGGCTTCA GCACCGAGGA 120 

55 

GCGCGCCGCG CCNTTCTCCC TGGAGTACCG AGTCTTCCTC AAAAATGAGA AAGGACAATA 180 

TATATCTCCA TTTCATGATA TTCCAATTTA TGCAGATAAG GATGTGTTTC ACATGGTAGT 240 

60 TGAAGTACCA CGCTGGTCTA ATGCAAAAAT GGAGATTGCT ACAAAGGACC CTTTAAACCC 300 
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(2) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2527 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



234 



TATTAAACAA GATGTGAAAA AAGGAAAACT TCGCTATGTT GCGAATTTGT TCCCGTATAA 360 

AGGATATATC TGGAACTATG GTGCCATCCC TCAGACTTGG GAAGACCCAG GGCACAATGA 420 

5 

TAAACATACT GGCTGTTGTG GTGACAATGA CCCAATTGAT GTGTGTGAAA TTGGAAGCAA 480 

GGTATGTGCA AGAGGTGAAA TAATTGGCGT GAAAGTTCTA GGCATATTGG CTATGATTGA 540 

10 CGAAGGGGAA ACOGACTGGA AAGTCA1TGC CATTAATGTG GATGATCCTG ATGCAGCCAA 600 

TTATAATGAT ATCAATGATG TCAAACGGCT GAAACCTGGC TACTTAGAAG CTACTGTGGA 660 

CTGGTTTAGA AGGTATAAGG TTCCTGATGG AAAACCAGAA AATGAGTTTG CGTTTAATGC 720 

15 

AGAATTTAAA GATAAGGACT TTGCCATTGA TATTATTAAA AGCACTCATG ACCATTGGAA 780 

AGCATTAGTG ACTAAGAAAA CGAATGGAAA AGGAATCAGT TGCATGAATA CAACTTTGTC 840 

20 TGAGAGCCCC TTCAAGTGTG ATCCTGATGC TGCCAGAGCC ATTGTGGATG CTTTACCACC 900 

ACCCTGTGAA TCTGCCTGCA CAGTACCAAC AGACGTGGAT AAGTGGTTCC ATCACCAGAA 960 

AAACTAATGA GATTTCTCTG GAATACAAGC TGATATTGCT ACATCGTGTT CATCTGGATG 1020 

25 

TATTAGAAGT AAAAGTAGTA GCTTTTCAAA GCTTTAAATT TGTAGAACTC ATCTAACTAA 1080 

AGTAAATTCT GCTGTGACTA ATCCAATATA CTCAGAATGT TATCCATCTA AAGCATTTTT 1140 

30 CATATCTCAA CTAAGATAAC TTTTAGCACA TGCTTAAATA TCAAAGCAGT TGTCATTTGG 1200 

AAGTCACTTG TGAATAGATG TGCAAGGGGA GCACATATTG GATGTATATG TTACCATATG 1260 

TTAGGAAATA AAATTATTTT GCTGAAAAAA AAAAAAAAAA AACCNCGGGG GGGGCCCCGG 1320 

TCCCCATTTG GCCCTTTGGG GGGNGGTTTT A 1351 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

CTCTTGCTAC CTTCCCGGCG CAGAGAACCC CGGCTGCTCA GCGCGCTCCG GGGTCATGGA 60 

GATCCCCGGG AGCCTGTGCA AGAAAGTCAA GCTGAGCAAT AACGCGCAGA ACTGGGGAAT 120 

55 GCAGAGAGCA ACCAATGTCA CCTACCAAGC CCATCATGTC AGCAGGAACA AGAGAGGTCA 180 

GGTGGTGGGG ACCAGAGGTG GCTTTCGTGG TTGCACAGTT TGGCTAACAG GCTTGTCTGG 240 

AGCGGGAAAG ACTACTGTGA GCATGGCCTT GGAGGAGTAC CTGGTTTGTC ATGGTATTCC 300 

60 
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25 



30 
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40 



45 



50 



55 



ATGCTACACT CTGGATGGTG ACAATATTCG TCAAGGTCTC AATAAAAATC TTGGCTTTAG 360 

TCCTGAAGAC AGAGAAGAGA ATGTTCGACG CATCGCAGAA GTTGCTAAAC TGTTTGCAGA 420 

TGCTGGCTTA GTGTGCATCA CAAGTTTCAT ATCACCTTAC ACTCAGGATC GCAACAATGC 480 

AAGGCAAATT CATGAAGGTG CAAGTTTACC GmTTTGAA GTATTTGTTG ATGCTCCTCT 540 

GCATGTTTGT GAACAGAGGG ATGTCAAAGG ACTCTACAAA AAAGCCCGGG CAGGAGAAAT 600 

TAAAGGTTTC ACTGGGATCG ATTCTGAATA TGAAAAGCCA GAGGCCCCTG AGTTGGTGCT 660 

GAAAACAGAC TCCTGTGATG TAAATGACTG TGTCCAGCAA GTTGTGGAAC TTCTACAGGA 720 

ACGGGATATT GTACXTTGTGG ATGCATCTTA TGAAGTAAAA GAACTATATG TGCCAGAAAA 780 

TAAACTTCAT TTGGCAAAAA CAGATGCGGA AACATTACCA GCACTGAAAA TTAATAAAGT 840 

GGATATGCAG TGGGTGCAGG TTTTGGCAGA AGGTTGGGCA ACCCCATTGA ATGGCTTTAT 900 

GAGAGAGAGG GAGTACTTGC AGTGCCTTCA TTTTGATTGT CTTCTGGATG GAGGTGTCAT 960 

TAACTTGTCA GTACCTATAG TTCTGACTGC GACTCATGAA GATAAAGAGA GGCTGGACGG 1020 

CTGTACAGCA TTTGCTCTGA TGTATGAGGG CCGCCGTGTG GCCATTCTTC GCAATCCAGA 1080 

GTTTTTTGAG CACAGGAAAG AGGAGCGCTG TGCCAGACAG TGGGGAACGA CATGCAAGAA 1140 

CCACCCCTAT ATTAAGATGG TGATGGAACA AGGAGATTGG CTGATTGGAG GAGATCTTCA 1200 

AGTCTTGGAT CGAGTTTATT GGAATGATGG TCTTGATCAG TATCGTCTTA CTCCTACTGA 1260 

GCTAAAGCAG AAATTTAAAG ATATGAATGC TGATGCTGTC TTTGCATTTC AACTACGCAA 1320 

CCCAGTGCAC AATGGACATG CCCTGTTAAT GCAGGATACC CATAAGCAAC TTCTAGAGAG 1380 

GGGCTACCGG CXXrCCTGTCC TCCTCCTCCA CCCTCTGGGT GGCTGGACAA AGGATGACGA 1440 

TGTTCCTTTG ATGTGGCGTA TGAAGCAGCA TGCTGCAGTG TTGGAGGAAG GAGTTCTGAA 1500 

TCCTGAGACG ACAGTGGTGG CCATCTTCCC ATCTCCCATG ATGTATGCTG GACCAACTGA 1560 

GGTCCAGTGG CATTGCAGAG CACGGATGGT TGCAGGAGCC AACTTTTACA TTGTTGGACG 1620 

AGACCCTGCT GGCATGCCTC ATCCAGAAAC AGGGAAGGAT CTTTATGAGC CAAGTCATGG 1680 

TGCCAAAGTG CTGACGATGG CCCCTGGTTT AATCACTTTG GAAATAGTTC CCTTTCGAGT 1740 

TGCAGCTTAC AACAAGAAAA AGAAGCGTAT GGACTACTAT GACTCTGAAC ACCATGAAGA 1800 

CTTTGAATTT ATTTCAGGAA CACGAATGCG CAAACTTGCT CGAGAAGGCC AGAAACCACC I860 

TGAAGGTTTC ATGGCTCCCA AGGCTTGGAC CGTGCTGACA GAATACTACA AATCCTTGGA 1920 

GAAAGCTTAG GCTGTTAACC CAGTCACTCC ACCTTTGACA CATTACTAGT AACAAGAGGG 1980 

GACCACATAG TCTCTGTTGG CATTTCTTTG TGGTGTCTGT CTGGACATGC TTCCTAAAAA 2040 

CAGACCATTT TCCTTAACTT GCATCAGTTT TGGTCTGCCT TATGAG1TCT GTTTTGAACA 2100 
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AGTGTAACAC ACTGATGGTT TTAATGTATC 
ATACAATTTT AAAATTGTCT TTTTATATTA 
5 AGCTGTTATA TTAGTTGTAA CCAGTAGTAT 
* AAAAAAGAAA AAAATTACCA AACAATAAAC 
AGACCTTTGT AGCGATTAGA TTTTTTTTCT 

10 

CTTTCCAGTC AGCTATTGGT CTTTCCAGCT 
GTAAGCTCTG AATGAACTTC TTTACTCAAT 
IS AAAAAAA 



236 

TTTTCCACTT ATTATAGTTA TATTCCTACA 2160 

TATTTATGCT TCTGTGTCAT GATTTTTTCA 2220 

TCACATTAAA TCTTGCTTTT TTTCCCCTTA 2280 

TTGGCTAGAC CTTGTTTTGA GGATTTTACA 2340 

ACATTGAAAA TAGAAACTGC TTCCTTTCTT 2400 

GTTATAATCT AAAGTATTCT TATGATCTGT 2460 

AAAATTAATT TOTTGGCTTC TTAAAAAAAA 2520 

2527 



20 (2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2566 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

30 CCCAAGAATT CGGCACGAGC GNGGCAWAAK TGGGATTTCT GAAACCTGTA GGCCCCAAGC 60 

CCATCAACTT GCCCAAAGAA GATTCCAAAC CTACATTTCC CTGGCCTSCT GGAAACAAGC 120 

CATCTCTTCA CAGTGTAAAC CAAGACCATG ACTTAAAGCC ACTAGGCCGA AATCTGGGCC 180 

35 

TACTCCTCCA ACCTCAGAAA ATGAACAGAA GCAAGCKTTT CCCAAATTGA CTGGGGTTAA 240 

AGGGAAATTT ATGTCAGCAT CACAAGATCT TGAACCCAAG CCCCTCTTCC CCAAACCOGC 300 

40 CTTTGGCCAG AAGCCGCCCC TAAGTACCGA GAACTCCCAT GAAGACGAAA GCCCCATGAA 360 

GAATGTGTCT TCATCAAAAG GGTCCCCAGC TCCCCTGGGA GTCAGGTCCA AAAGCGGCCC 420 

TTTAAAACCA GCAAGGGAAG ACTCAGAAAA TAAAGACCAT GCAGGGGAGA TTTCAAGTTT 480 

45 

GCCCTTTCCT GGAGTGGTTT TGAAACCTGC TGCGAGCAGG GGAGGCCCAG GTCTCTCCAA 540 

AAATGGTGAA GAAAAAAAGG AAGATAGGAA GATAGATGCT GCTAAGAACA CCTTCCAGAG 600 

50 CAAAATAAAT CAGGAAGAGT TGGCCTCAGG GACTCCTCCT GCCAGGTTCC CTAAGGCCCC 660 

TTCTAAGCTG ACAGTGGGGG GGCCATGGGG CCAAAGTCAG GAAAAGGAAA AGGGAGACAA 720 

^ GAATTCAGCC ACCCCGAAAC AGAAGCCATT GCCTCCCITG TTTACCTTGG GTCCACCTCC 780 

ACCAAAACCC AACAGACCAC CAAATGTTGA CCTGACGAAA TTCCACAAAA CCTCTTCTGG 840 

AAACAGTACT AGCAAAGGCC AGACGTCTTA CTCAACAACT TCCCTGCCAC CACCTCCACC 900 

60 ATCCCATCCG GCCAGCCAAC CACCATTGCC AGCATCTCAC CCATCACAAC CACCAGTCCC 960 
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AAGCCTACCT CCCAGAAACA TTAAACCTCC GTTTGACCTA AAAAGCCCTG TCAATGAAGA 1020 

^ CAATCAAGAT GGTGTCACGC ACTCTGATGG TGCTGGAAAT CTAGATGAGG AACAAGACAG 1080 

TGAAQGAGAA ACATATGAAG ACATAGAAGC ATCCAAAGAA AGAGAGAAGA AAAGGGAAAA 1140 

GGAAGAAAAG AAGAGGTTAG AGCTGGAGAA AAAGGAACAG AAAGAGAAAG AAAAGAAAGA 1200 

10 ACAAGAAATA AAGAAGAAAT TTAAACTAAC AGGCCCTATT CAAGTCATCC ATCTTGCAAA 1260 

AGCTTGTTGT GATGTCAAAG GAGGAAAGAA TGAACTGAGC TTCAAGCAAG GAGAGCAAAT 1320 

TGAAATCATC CGCATCACAG ACAACCCAGA AGGAAAATGG TTGGGCAGAA CAGCAAGGGG 1380 

15 

TTCATATGGC TATATTAAAA CAACTGCTGT AGAGATTGAC TATGATTCTT TGAAACTGAA 1440 

AAAAGACTCT CTTGGTGCCC CTTCAAGACC TATTGAAGAT GACCAAGAAG TATATGATGA 1500 

20 TGTTGCAGAG CAGGATGATA TTAGCAGCCA CAGTCAGAGT GGAAGTGGAG GGATATTCCC 1560 

TCCACCACCA GATGATGACA TTTATGATGG GATTGAAGAG GAAGATGCTG ATGATGGCTC 1620 

CACACTACAG GTTCAAGAGA AGAGTAATAC GTGGTCCTGG GGGATTTTGA AGATGTTAAA 1680 

25 

GGGAAAAGAT GACAGAAAGA AAAGTATACG AGAGAAACCT AAAGTCTCTG ACTCAGACAA 1740 

TAATGAAGGT TCATCTTTCC CTGCTCCTCC TAAACAATTG GACATGGGAG ATGAAGTTTA 1800 

30 CGATGATGTG GATACCTCTG ATTTCCCTGT TTCATCAGCA GAGATGAGTC AAGGAACTAA 1860 

TGTTGGAAAA GCTAAGACAG AAGAAAAGGA CCTTAAGAAG CTAAAAAAGC AGRAAAAARA 1920 

ARAAAAAGAC TTCAGGAAAA AATTTAAATA TGATGGTGAA ATTAGAGTCC TATATTCAAC 1980 

35 

TAAAGTTACA ACTTCCATAA CTTCTAAAAA GTGGGGAACC AGAGATCTAC AGGTAAAACC 2040 

TGGTGAATCT CTAGAAGTTA TACAAACCAC AGATGACACA AAAGTTCTCT GCAGAAATGA 2100 

40 AGAAGGGAAA TATGGTTATG TCCTTCGGAG TTACCTAGCG GACAATGATG GAGAGATCTA 2160 

TGATGATATT GCTGATGGCT GCATCTATGA CAATGACTAG CACTCAACTT TGGTCATTCT 2220 

GCTGTGTTCA TTAGGTGCCA ATGTGAAGTC TGGATTTTAA TTGGCATGTT ATTGGGTATC 2280 

45 

AAGAAAATTA ATGCACAAAA CCACTTATTA TCATTTGTTA TGAAATCCCA ATTATCTTTA 2340 

CAAAGTGTTT AAAGTTTGAA CATAGAAAAT AATCTCTCTG CTTAATTGTT ATCTCAGAAG 2400 

50 ACTACATTAG TGAGATGTAA GAATTATTAA ATATTCCATT TCCGCTTTGG CTACAATTAT 2460 

GAAGAAGTTG AAGGTACTTC TTTTAGACCA CCAGTAAATA ATCCTCCTTC AAAAAATAAA 2520 

^ AATAAAAAAA AAAAAAAAAA ACTCGAGGGG GGGCCCGGTA CCCAAT 2566 



60 



(2) INFORMATION FOR SEQ ID NO: 88: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

GAATTCGGCA CGAGGCTTTC TGTGTCCTCT GTGGCTGCTT TAGTGTGCCA CCAGGGGCAG 60 

ACTTGGGTGG GTTGCAGCAG AGATGGCATG GCCCTCAAGG TCCAAGATGT 1TACTCTCTT 120 

GCCGGTCCTC TGTTATCTCT GGTdTTGTG GTTGCCACAG TTTTCTTGGA TCCAGGAGTT 180 

15 AAAGGCAGTC CTGAGGGATG ATGGCCTCAT CTCCGCAGTT GCYTGGAATG CTGAATTTCA 240 

GACGTGCTAA AGGAGGGTTG CAGACATTGT GTGGWATGCA TTCAGACCCC AGATGTGGGT 300 

GCAGGAAGGC AGGCATGGCA CAGCCAGGTA GAGACTGGTT TCCAGGCCCA AGCAGCCTTC 360 

20 

AGCAGCTGTG CGCCTTGTTT CTGATGTTGT TTGGGAGTAA GAATAATGTA GACATGGGGG 420 

GTCATGARGC TCAATAAAAA CTTCAAGGAA ACCTCCCATG GCATGGTTGG GCGCAGTGAC 480 

25 TCATGCCTGT AACCCCAGCA CTGTGGAATG CCAAGGTGGA AGGATCGCTT GAGGCCAAGA 540 



30 (2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1863 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

40 TCGACCCACG CGTCCGGCGA GATCCCTACC GCAGTAGCCG CCTCTGCCGC CGCQGAGCTT 60 

CCCGAACCTC TTCAGCCGCC CGGAGCCGCT CCCGGAGCCC GGCCGTAGAG GCTGCAATCG 120 

CAGCCGGGAG CCCGCAGCCC GCGCCCCGAG CCCGCCGCCG CCCTTCGAGG GCGCCCCAGG 180 

45 

CCGCGCCATG GTGAAGGTGA CGTTCAACTC CGCTCTGGCC CAGAAGGAGG CCAAGAAGGA 240 

CGAGCCCAAG AGCGGCGAGG AGGCGCTCAT CATCCCCCCC GACGCCGTCG CGGTGGACTG 300 

50 CAAGGACCCA GATGATGTGG TACCAGTTGG CCAAAGAAGA GCCTGGTGTT GGTGCATGTG 360 

CTTTGGACTA GCATTTATGC TTGCAGGTTGT TATTCTAGGA GGAGCATACT TGTACAAATA 420 

^ TTTTGCACTT CAACCAGATG ACGTGTACTA CTGTGGAATA AAGTACATCA AAGATGATGT 4B0 

CATCTTAAAT GAGCCCTCTG CAGATGCCCC AGCTGCTCTC TACCAGACAA TTGAAGAAAA 540 

TATTAAAATC TTTGAAGAAG AAGAAGTTGA ATTTATCAGT GTGCCTGTCC CAGAGTTTGC 600 

60 AGATAGTGAT CCTGCCAACA TTGTTCATGA CTTTAACAAG AAACTTACAG CCTATTTAGA 660 
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15 



20 



25 



30 



35 



40 



TCTTAACCTG GATAAGTGCT ATGTGATCCC TCTGAACACT TCCATTGTTA TGCCACCCAG 720 

AAACCTACTG GAGTTACTTA TTAACATCAA GGCTGGAACC TATTTGCCTC AGTCCTATCT 780 

GATTCATGAG CACATGGTTA TTACTGATCG CATTGAAAAC ATTGATCACC TGGGTTTCTT 840 

TATTTATCGA CTGTGTCATG ACAAGGAAAC TTACAAACTG CAACGCAGAG AAACTATTAA 900 

AGGTATTCAG AAACGTGAAG CCAGCAATTG TTTCGCAATT CGGCATTTTG AAAACAAATT 960 

TGCCGTGGAA ACTTTAATTT GTTCTTGAAC AGTCAAGAAA AACATTATTG AGGAAAATTA 1020 

ATATCACAGC ATAACCCCAC CCTTTACATT TTGTGCAGTG ATTATTTTTT AAAGTCTTCT 1080 

TTCATGTAAG TAGCAAACAG GGCITTACTA TCTTTTCATC TCATTAATTC AATTAAAACC 1140 

ATTACCTTAA AATTTTTTTC TTTCGAAGTG TGGTGTCTTT TATATTTGAA TTAGTAACTG 1200 

TATGAAGTCA TAGATAATAG TACATGTCAC CTTAGGTAGT AGGAAGAATT ACAATTTCTT 1260 

TAAATCATTT ATCTGGATTT TTATGTTTTA TTAGCATTTT CAAGAAGACG GATTATCTAG 1320 

AGAATAATCA TATATATGCA TACGTAAAAA TGGACCACAG TGACTTATTT GTAGTTGTTA 1380 

GTTGCCCTGC TACCTAGTTT GTTAGTGCAT TTGAGCACAC ATTTTAATTT TCCTCTAATT 1440 

AAAATGTGCA GTATTTTCAG TGTCAAATAT ATTTAACTAT TTAGAGAATG ATTTCCACCT 1500 

TTATGTTTTA ATATCCTAGG CATCTGCTGT AATAATATTT TAGAAAATGT TTGGAATTTA 1560 

AGAAATAACT TGTGTTACTA ATTTGTATAA CCCATATCTG TGCAATGGAA TATAAATATC 1620 

ACAAAGTTGT TTAACTAGAC TGCGTGTTGT TTTTCCCGTA TAATAAAACC AAAGAATAGT 1680 

TTGGTTCTTC AAATCTTAAG AGAATCCACA TAAAAGAAGA AACTATTTTT TAAAAATTCA 1740 

CTTCTATATA TACAATGAGT AAAATCACAG ATTTTTTCTT TAAATAAAAA TAAGTCAITT 1800 

TAATAACTAA ACCAGATTCT TTGTGATACT ATTAANGTAA CATTTAGCCC CAAAAAAAAA I860 

AAA 1863 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEMESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
GGCACAGCGG CACGAGGTGA GCTGAGCCGG TGGGTGAGCG GCGGCCACGG CATCCTGTGC 
TGTGGGGGCT ACGAGGAAAG ATCTAATTAT CATGGACCTG CGACAGTTTC TTATGTGCCT 



60 
120 
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GTCCCTGTGC ACAGCCTTTG CCTTGAGCAA 
TGAGCCTCAG CTCAGTGACA AGGTTCACAA 
5 TGCCTTCTTG GGTGCTGAAG AAGCAAAGAC 
GGAAAGGCTT GGAAAGATTG TAAGTAAAAT 
GGATGAGCTC AAAGACTGGA TTAAATTTGC 

10 

GCGACAGTGG AAGGGGCATG ACCTCAATGA 
AAATGCCACC TACGGCTACG TTTTAGATGA 
15 ACAGATGATG GTTAGAGATG AGCGGAGGTT 
TGCCACCAAG GAGGAGTTCA CAGCTTTCCT 
TATAGTAGTA CAGGAAACAA TGGAAGATAT 

20 

AGAAGAGTAT ATTGGTGACA TGTACAGCCA 
AAAGACAGAG CGAGAGCAGT TTGTTGAGTT 
25 CAAGGAAGAG ACCAAAGACT GGATCCTTCC 
CAGGCACCTG GTCTATGAAT CAGACCAAAA 
CGTTGACAAG TATGACTTAT TTGTTGGCAG 

30 

ACGGCATGAT GAGTTCTGAG CTRCGGAGGA 
TACAGCTTCT GGTTTCACAT GAAATTGTTT 
35 TAAGACATGA AAAGGCGTAA TGAAAACCAT 
ACTGGAGGGA AGCCGTGCTT CTGAGGAACA 
TTACACTTTG TATTATGTAT TAACATGGCG 

40 

AGTATGATAT GAAGGATCAA GATCCTCAAC 
CTCTTTCTCA ACCCCTTTTA TGATTTTAAT 
45 AGATCAATAA GAAATGTTCA GGAGAGAGGA 
ATTTAGAGAG AGAACACTTA GTCTTGCCTG 
GGGCCACATA TTACATTCAG TTGCTATAGG 

50 

AAGGAAAGAT CCCTTTGCTC TAGGAAAGCT 
CTGTAGGACT GACTGTTGGC TAATTTTGTC 
55 CAGTGTCTTG AAAATCAATC AAGTAGTGAA 
ACAGCTGGAA AACTAAAGGA AAAATACAAG 
TGTGCATCTG TTGAAATGCT CAAGACTTAA 

60 



ACCCACAGAA AAGAAGGACC GTGTACATCA 180 

TGATGCTCAG AGTTTTGATT ATGACCATGA 240 

CTTTGATCAG CTGACACCAG AAGAGAGCAA 300 

AGATGGCGAC AAGGACGGGT TTGTCACTGT 360 

ACAAAAGCGC TGGATTTACG AGGATGTAGA 420 

GGACGGCCTC GTTTCCTGGG AGGAGTATAA 480 

TCCAGATCCT GATGATGGAT TTAACTATAA 540 

TAAAATGGCA GACAAGGATG GAGACCTCAT 600 

GCACCCTGAG GAGTATGACT ACATGAAAGA 660 

AGATAAGAAT GCTGATGGTT TCATTGATCT 720 

TGATGGGAAT ACTGATGAGC CAGAATGGGT 780 

TCGGGATAAG AACCGTGATG GGAAGATGGA 840 

CTCAGACTAT GATCATGCAG AGGCAGAAGC 900 

CAAGGATGGC AAGCTTACCA AGGAGGAGAT 960 

CCAGGCCACA GATTTTGGGG AGGCCTTAGT 1020 

ACCCTCATTT CCTCAAAAGT AATTTATTTT 1080 

GCGCTACTGA GACTGTTACT ACAAACTTTT 1140 

CCCGTCCCCA TTCCTCCTCC TCTCTGAGGG 1200 

ACTCTAATTA GTACACTTGT GTTTGTAGAT 1260 

TGTTTATTTT TGTATTTTTC TCTGGTTGGG 1320 

TCACACATGT AGACAAACAT TAGCTCTTTA 1380 

AATTCTCACT TAACTAATTT TGTAAGCCTG 1440 

AAGAAAAAAA ATATATGCTC CACAATTTAT 1500 

TCAAAAAGTC CAACATTTCA TAGGTAGTAG 1560 

TCCAGCAACT GAACCTGCCA TTACCTGGGC 1620 

TGGCCCAAAT TGATITTCTT CTTTTTCCCC 1680 

AAGCACAGCT GTGGTGGGAA GAGTTAGGGC 1740 

TGTGATCTCT TTGCAGAGCT ATAGATAGAA 1800 

TGTTTTCGGG GCATACATTT TTTTTCTGGG 1860 

TTATTTGCCT TTTGAAATCA CTGTAAATGC 1920 
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CCCCATCCGG TTCCTCTTCT TCCCAGGTGT GCCAAGGAAT TAATCTTGGT TTCACTACAA 1980 

TTAAAATTCA CTCCTTTCCA ATCATGTCAT TGAAAGTGCC TTTAACGAAA GAAATGGTCA 2040 

CTGAATGGGA ATTCTCTTAA GAAACCCTGA GATTAAAAAA AGACTATTTG GATAACTTAT 2100 

AGGAAAGCCT AGAACCTCCC AGTAGAGTGG GGATTTTTTT CTTCTTCCCT TTCTCTTTTG 2160 

GACAATAGTT AAATTAGCAG TATTAGTTAT GAGTTTGGTT GCAGTGTTCT TATCTTGTGG 2220 

GCTGATTTCC AAAAACCACA TGCTGCTGAA TTTACCAGGG ATCCTCATAC CTCACAATGC 2280 

AAACCACTTA CTACCAGGCC TTTITCTGTG TCCACTGGAG AGCTTGAGCT CACACTCAAA 2340 

15 GATCAGAGGA CCTACAGAGA GGGCTCTTTG GTTTGAGGAC CATGGCTTAC CTTTCCTGCC 2400 

TTTGACCCAT CACACCCCAT TTCCTCCTCT TTCCCTCTCC CCGCTGCCAA TTCCTGCAGC 2460 

CCGGGGGAAC CACTAGTT 2478 

20 



25 



30 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TCGGCCTTGC TTTTGTGGYC TTCCTCTGTG GCCAGAGCGT TTTCATCACC AAGCCTCCTG 60 

ATGGCAGTNC CTTCACCGAT ATGTTCAAGA TACTGACGTA TTCCTGCTGT TCCCAGAAGC 120 

GAAGTGGAGA GCGCCAGAGT AATGGTGAAG GCATTGGAGT NTTTCAGCAA TCTTCTAAAC 180 

AAAGTCTGTT TGATTCATGT AAGATGTCTC ATGGTGGGCC ATTTACAGAA GAGAAAGTGG 240 

AAGATGTGAA AGCTCTGGTC AAGATTGTCC CTGTTTTCTT GGCTTTGATA CCTTACTGGA 300 

CAGTGTATTT CCAAATGCAG ACAACATATG TTTTACAGAG TCTTCATTTG AGGATTCCAG 360 

AAATTTCAAA TATTACAACC ACTCCTCACA CGCTCCCTGC AGCCTGGCTG ACCATGTTTG 420 

ATGCTGTGCT CATCCTCCTG CTCATCCCTC TGAAGGACAA ACTGGTCGAT CCCATTTTGA 480 

GAAGACATGG CCTGCTCCCA TCCTCCCTGA AGAGGATCGC CGTGGGCATG TTCTTTGTCA 540 

TGTGCTCRGC CTTTGCTGCA GGAATTTTGG AGAGTAAAAG GCTGAACCTT GTTAAAGAGA 600 

AAACCATTAA TCAGACCATC GGCAACGTCG TCTACCATGC TGCCGATCTG TCGCTGTGGT 660 

GGCAGGTGCC GCAGTACTTG CTGATTGGGA TCAGCGAGAT CTTTGCAAGT ATCGCAGGCC 720 

TGGAATITGC ATACTCAGCT GCCCCCAAGT CCATGCAGAG TGCCATAATG GGC1TGTTCT 780 

TTTTCTTCTC TGGCGTCGGG TCGTTCGTGG GTTCTGGACT GCTGGCACTG GTGTCTATCA 840 
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AAGCCATCGG ATGGATGAGC AGTCACACAG ACTTTGGTAA TATTAACGGC TGCTATTTGA 900 

ACTATTACTT TTTCCTTCTG GCTGCTATTC AAGGAGCTAC CCTCCTGCTT TTCCTCATTA 960 

5 

TTTCTGTGAA ATATGACCAT CATCGAGACC ATCAGCGATC AAGAGCCAAT GGCGTGCCCA 1020 

CCAGCAGGAG GGCCTGACCT TCCTGAGGCC ATGTGCGGTT TCTGAGGCTG ACATGTCAGT 1080 

10 AACTGACTGG GGTGCACTGA GAACAGGCAA GACTTTAAAT TCCCATAAAA TGTCTGACTT 1140 

CACTGAAACT TGCATGTTGC CTGGATTGAT TTCTTCTTTC CCTCTATCCA AAGGAGCTTG 1200 

GTAAGTGCCT TACTGCAGCG TGTCTCCTGG CACGCTGGGC CCTCCGGGAG GAGAGCTGCA 1260 

15 

GATTTCGAGT ATGTCGCTTG TCATTCAAGG TCTCTGTGAA TCCTCTAGCT GGGTTCCCTT 1320 

TTTTACAGAA ACTCACAAAT GGAGATTGCA AAGTCTTGGG GAACTCCACG TGTTAGTTGG 1380 

20 CATCCCAGTT TCTTAAACAA ATAGTATCAC CTGCTTCCCA TAGCCATATC TCACTGTAAA 1440 

AAAAAAAATT AATAAACTGT TACTTATATT TAAGAAAGTG AGGATTTTTT TTTTTTAAAG 1500 

ATAAAAGCAT GGTCAGATGC TGCAAGGATT TTACATAAAT GCCATATTTA TGGTTTCCTT 1560 

25 

CCTGAGAACA ATCTTGCTCT TGCCATGTTC TTTGATTTAG GCTGGTAGTA AACACATTTC 1620 

ATCTGCTGCT TCAAAAAGTA CTTTACTTTTT AAACCATCAA CATTACTTTT CTTTCTTAAG 1680 

30 GCAAGGCATG CATAAGAGTC ATTTGAGACC ATGTGTCCCA TCTCAAGCCA CAGAGCAACT 1740 

CACGGGGTAC TTCACACCTT ACCTAGTCAG AGTGCTTATA TATAGCTTTA TTTTGGTACG 1800 

ATTGAGACTA AAGACTGATC ATGGTTGTAT GTAAGGAAAA CATTCTTTTG AACAGAAATA 1860 

35 

GTGTAATTAA AAATAATTGA AAGTGTTAAA TGTGAACTTG AGCTGTTTGA CCAGTCACAT 1920 

TTTTGTATTG TTACTGTACG TGTATCTGGG GCTTCTCCGT TTGTTAATAC TTTTTCTGTA 1980 

40 TTTGTTGCTG TATTTTTGGC ATAACTTTAT TATAAAAAGC ATCTCAAATG CGAAAWAAAA 2040 

AAAAAAAAAA AAAAAAAC 2058 



45 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 1411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

GGCACAGGAG CGACCCGGGA GAAGGAGGGC CAMGAKGCGG AAGCGGAGGA GTCTCCAGGA 60 
GACCCGGGGA CAGCATCGCC CAGGCCCCTG TTTGCAGGCC TTTCAGATAT ATCCATCTCA 120 

60 
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CAAGACATCC CCGTAGAAGG AGAAATCACC ATTCCTATGA GATCTCGCAT CCGGGAGTTT 180 

GACAGCTCCA CATTAAATGA ATCTGTTCGC AATACCATCA TGCGTGATCT AAAAGCTGTT 240 

GGGAAAAAAT TCATGCATGT TTTGTACCCA AGGAAAAGTA ATACTCTTTT GAGAGATTGG 300 

GATTTGTGGG GCCCTTTGAT CCTTTGTGTG ACACTCGCAT TAATGCTGCA AAGAGACTCT 360 

GCAGATAGTG AAAAAGATGG AGGQCCCCAA TTTGCAGAGG TGTTTGTCAT TGTCTGGTTT 420 

GGTGCAGTTA CCATCACCCT CAACTCAAAA CTTCTTGGAG GGAACATATC TTTTTTTCAG 480 

AGCCTCTGTG TGCTGGGTTA CTGTATACTT CCCTTGACAG TAGCAATGCT GATTTGCCGG 540 

15 CTGGTACTTT TGGCTGATCC AGGACCTGTA AACTTCATGG TTCQGCTTTT TGTGGTGATT 600 

GTGATGTTTG CCTGGTCTAT AGTTGCCTCC ACAGCTTTCC TTGCTGATAG CCAGCCTCCA 660 

AACCGCAGAG CCCTAGCTGT TTATCCTGTT TTCCTGTTTT ACTTTGTCAT CAGTTGGATG 720 

20 

ATTCTCACCT TTACTCCTCA GTAAATCAGG AATGGGAAAT TAAAAACCAG TGAATTGAAA 780 

GCACATCTGA AAGATGCAAT TCACCATGGA GCTTTGTCTC TGGCCCTTAT TTGTCTAATT 840 

25 TTGGAGGTAT TTGATAACTG AGTAGGTGAG GAGATTAAAA GGGAGCCATA TAGCACTGTC 900 

ACCCCTTATT TGAGGAACTG ATGTTTGAAA GGCTGTTCTT TTCTCTCTTA ATGTCATTTC 960 

TTTAAAAATA CATGTGCATA CTACACACAG TATATAATGC CTCCTTAAGG CATGATGGAG 1020 

30 

TCACCGTGGT CCATTTGGGT GACAACCAGT GACTTGGGAA GCACATAGAT ACATCTTACA 1080 

AGTTGAATAG AGTTGATAAC TATTTTCAGT TTTGAGAATA CCAGTTCAGG TGCAGCTCTT 1140 

35 AAACACATTG CCTTATGACT ATTAGAATAT GCCTCTCTTT TCATAAATAA AAATACATGG 1200 

TCTATATCCA TTTTCTTTTA TTTCTCTCTC TTAAGCTTAA AAAGGCAATG AGAGAGGTTA 1260 

GGAGTGGGTT CATACACGGA GAATGAGAAA ACATGCATTA ACCAATATTC AGATTTTGAT 1320 

40 

CAGGGGAAAT TCTAYACTTG TTGCAAAAAA AAAAAAAAAA AAACTCGAGG GGGGCCCGGT 1380 

ACCCAATCGC NGTATATGAT CGNAAACAAT C 1411 

45 

(2) INFORMATION FOR SEQ ID NO: 93: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GCTTTGGCTT TTTTTGGCGG ACTGGGGCGC CCTCCGGAAG CGTTTCCAAC TTTCCAGAAG 60 
60 TTTCTCQGGA CGGGCAGGAG GGGGTGGGGA CTGCCATATA TAGATCCCQG GAGCAGGGGA 120 
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GOGGGCTAAG AGTAGAATCG TGTCGCGCTC GAGAGCGAGA GTCACGTCCC GGCGCTAGCC 180 

CAGCCCGACC CAGGCCCACC GTGGTGCACG CAAACCACTT CCTGGCCATG CGCTCCCTCC 240 

5 

TGCTTCTCAG CGCCTTCTGC CTCCTGGAGG CGGCCCTGGC CGCCGAGGTG AAGAAACCTG 300 

CAGCCGCAGC AGCTCCTGGC ACTGCGGAGA AGTTGAGCCC CAAGGCGGCC ACGCTTGCCG 360 

10 AGCGCAGCCG GCCTGGCCTT CAGCTTGTAC CAGGCCATGG CCAAGGACCA GGCAGTGGAG 420 

AACATCCTGG TGTCACCCGT GGTGGTGGCC TCGTCGCTGG GGCTCGTGTC GCTGGGCGGC 480 

AAGGCGACCA CGGCGTCGCA GGCCAAGGCA GTGCTGAGCG CCGAGCAGCT GCGCGACGAG 540 

15 

GAGGTGCACG CCGGCCTGGG CGAGCTGCTG CGCTCACTCA GCAACTCCAC GGCGCGCAAC 600 

GTGACCTGGA AGCTGGGCAG CCGACTGTAC GGACCCAGCT CAGTGAGCTT CGCTGATGAC 660 

20 TTCGTGCGCA GCAGCAAGCA GCACTACAAC TGCGAGCACT CCAAGATCAA CTTCCGCGAC 720 

AAGCGCAGCG CGCTGCAGTC CATCAACGAG TGGGCCGCGC AGACCACCGA CGGCAAGCTG 780 

CCCGAGGTCA CCAAGGACGT GGAGCGCACG GACGGCGCCC TGTTAGTCAA CGCCATGTTC 840 

25 

TTCAAGCCAC ACTGGGATGA GAAATTCCAC CACAAGATGG TGGACAACCG TGGCTTCATG 900 

GTGACTCGGT CCTATACCGT GGGTGTCATG ATGATGCACC GGACAGGCCT CTACAACTAC 960 

30 TACGACGACG AGAAGGAAAA GCTGCAAATC GTGGAGATGC CCCTGGCCCA CAAGCTCTCC 1020 

AGCCTCATCA TCCTCATGCC CCATCACGTG GAGCCTCTCG AGCGCCTTGA AAAGCTGCTA 1080 

ACCAAAGAGC AGCTGAAGAT CTGGATGGGG AAGATGCAGA AGAAGGCTGT TGCCATCTCC 1140 

35 

TTGCCCAAGG GTGTGGTGGA GGTGACCCAT GACCTGCAGA AACACCTGGC TGGGCTGGGC 1200 

CTGACTGAGG CCATTGACAA GAACAAGGCC GACTTGTCAC GCATGTCAGG CAAGAAGGAC 1260 

40 CTGTACCTGG CCAGCGTGTT CCACGCCACC GCCTTTGAGT TGGACACAGA TGGCAACCCT 1320 

TTGACCAGAA TTACGGGCGG AGGAGTGCGC ACCCAAGTGT TCTACGCCGA CCACCCCTTC 1380 

ATTTCCTAGT GCGGGACACC CAAAGCGGTC CCTGCTATTC ATTGGGCGCC TGGTCCGGCC 1440 

45 

TAAGGGTGAC AAGATGCGAG ACGAGTTATA GGCCTCAGGG TGCACACAGG ATGGCAGGAG 1500 

GCATCCAAAG GCTCCTGAGA CACATGGGTG CTATTGGGGT TGGGGGGGAG GTGAGGTACC 1560 

50 AGCCTTGGAT ACTCCATGGG GTGGGGTGGA AAAGCAGACC GGGGTTCCCG TGTGCCTGAG 1620 

CGGACTTCCC AGCTAGAATT CACTCCACTT GGACATGGGC CCCAGATACC ATGATGCTGA 1680 

GCCCGGAAAC TCCACATCCT GTGGGACCTG GGCCATAGTC ATTCTGCCTG CCCTGAAAGT 1740 

55 

CCCAGATCAA GCCTGCCTCA ATCAGTATTC ATATTTATAG CCAGGTACCT TCTCACCTGT 1800 

GAGACCAAAT TGAGCTAGGG GGGTCAGCCA GCCCTCTTCT GACACTAAAA CACCTCAGCT 1860 

60 GCCTCCCCAG CTCTATCCCA ACCTCTCCCA ACTATAAAAC TAGGTGCTGC AGCCCCTGGG 1920 
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ACCAGGCACC CCCAGAATGA CCTGGCCGCA GTGAGGCGGA TTGAGAAGGA GCTCCCAGGA 1980 

GGGGCTTCTG GGCAGACTCT GGTCAAGAAG CATCGTGTCT GGCGTTGTGG GGATGAACTT 2040 

5 

TTTGTTTTGT TTCTTCCTTT TTTAGTTCTT CAAAGATAGG GAGGGAAGGG GGAACATGAG 2100 

CCTTTCTTGC TATCAATCCA AGAACTTATT TGTACATTTT TTTTTTCAAT AAAAdTTTC 2160 

10 CAATGACAAA AAAAAAAAAA AAAAAAA 2187 



15 (2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 base pairs 

(B) TYPE: nucleic acid 
20 <C) STRANDEDKESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

25 GACAGTACGG TCGGATTCCC GGGTCGACCC ACGCGTCCGC GGACGGTGAA GAAGGTGAAG 60 

ATGGCGGTGG CCAGGGCCGG GGTCTTGGGA GTCCAGTGGC TGCAAAGGGC ATCCCGGAAC 120 

GTGATGCCGC TGGGCGCACG GACAGCCTCC CACATGACCA AGGACATGTT CCCGGGGCCC 180 

30 

TATCCTAGGA CCCCAGAAGA ACGGGCCGCC GCCGCCAAGA AGTATAATAT GCGTGTGGAA 240 

GACTACGAAC CTTACCCGGA TGATGGCATG GGGTATGGCG ACTACCCGAA GCTCCCTGAC 300 

35 CGCTCACAGC ATGAGAGAGA TCCATGGTAT AGCTGGGACC AGCCGGGCCT GAGGTTGAAC 360 

TGGGGTGAAC CGATGCACTG GCACCTAGAC ATGTACAACA GGAACCGTGT GGATACATCC 420 

CCCACACCTG TTTCTTGGCA TGTCATGTGT ATGCAGCTCT TCGGTTTCCT GGCTTTCATG 480 

40 

ATATTCATGT GCTGGGTGGG GGACGTGTAC CCTGTCTACC AGCCTGTGGG ACCAAAGCAG 540 

TATCCTTACA ATAATCTGTA CCTGGAACGA GGCGGTGATC CCTCCAAAGA ACCAGAGCGG 600 

45 GTGGTTCACT ATGAGATCTG AGGAGGCTTC GTGGGCITTT GGGTCCTCTA ACTAGGACTC 660 

CCTCATTCCT AGAAATTTAA CCTTAATGAA ATCCCTAATA AAACTCAGTG CTGTGTTAAA 720 

AAAAAAAAAA AAAAAAAAAA AAAAAGGGGG GCCCCNN 757 

50 



55 



(2) INFORMATION FOR SEQ ID NO: 95: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

GGCACGAGCA CTCCTGCACT TCCCCACCCC CACGACCGAA CCTGGCTTCG CTAACGCCCT 60 

5 

CCCAGCTCCC TOGGGCCTGA COTCCGGTTT CCTCGCGCGT CCCTGGCGCC GAGCCGCGGA 120 

CAGCAGCCCC TTTTCCGGCT GAGAGCTCAT CCACACTTCC AATCACTTTC CGGAGTGCTT 180 

10 CCCCTCCCTC CQGCCCGTGC TGGTCCCGAC GGCQGGCCTG GGTCTCGCGC GCGTATTGCT 240 

GGGTAACGGG CCTTCTCYCG CGTCGGCCCG GCCCCTCCTG CCTCGGCTCG TCCCTCCTTC 300 

CAGAACGTCC CGGGCTCCTG CCGAGTCAGA AGAAATGGGA CTCCCTCCGC GACGTGCCCG 360 

15 

GAGCAGCTCC CTTCGCTGTG GAAGCGGCGG TGTCTTCGAA GAAACCGGAA GCCCGTGGTG 420 

ACCCCTGGCG ACCCGGTTTG TTTTCGGTCC GTTTCCAAAC ACTAAGGAAT CGAAACTCGG 480 

20 CGGCCTTGGG GGCGGCCCTA CGTAGCCTGG CTTCTGGTTG TCATGGATGC ACTGGTAGAA 540 

GATGATATCT GTATTCTGAA TCATGAAAAA GCCCATAAGA GAGATACAGT GACTCCAGTT 600 

TCAATATATT CAGGAGATGA ATCTGTTGCT TCCCATTTTG CTCTTGTCAC TGCATATGAA 660 

25 

GACATCAAAA AACGACTTAA GGATTCAGAG AAAGAGAACT CTTTGTTAAA GAAGAGAATA 720 

AGATTTTTGG AAGAAAAGCT AATAGCTCGA TTTGAAGAAG AAACAAGTTC CGTGGGACGA 780 

30 GAACAAGTAA ATAAGGCCTA TCATGCATAT CGAGAGGTTT GCATTGATAG AGATAATTTG 840 

AAGAGCAAAC TGGACAAAAT GAATAAAGAC AACTCTGAAT CTTTGAAAGT ATTGAATGAG 900 

CAGCTACAAT CTAAAGAAGT AGAACTCCTC CAGCTGAGGA CAGAGGTGGA AACTCAGCAG 960 

35 

GTGATGAGGA ATTTAAATCC ACCTTCATCA AACTGGGAGG TGGAAAAGTT GAGCTGTGAC 1020 

CTGAAGATCC ATGGTTTGGA ACAAGAGCTG GAACTGATGA GGAAAGAATG TAGCGATCTC 1080 

40 AAAATAGAAC TACAGAAAGC CAAACAAACG GATCCATATC AGGAAGACAA TCTGAAGAGC 1140 

AGAGATCTCC AAAAACTAAG CATTTCAAGT GATAATATGC AGCATGCATA CTGGGAACTG 1200 

AAGAGAGAAA TGTCTAATTT ACATCTGGTG ACTCAAGTAC AAGCTGAACT ACTAAGAAAA 1260 

45 

CTGAAAACCT CAACTGCAAT CAAGAAAGCC TGTGCCCCTG TAGGATGCAG TGAAGACCTT 1320 

GGAAGAGACA GCACAAAACT GCACTTGATG AATTTTACTG CAACATACAC AAGACATCCC 1380 

50 CCTCTCITAC CAAATGGCAA AGCTCTTTGT CATACCACAT CTTCCCCTTT ACCAGGAGAT 1440 

GTAAAGGTTT TATCAGAGAA AGCAATCCTC CAATCATGGA CAGACAATGA GAGATCCATT 1500 

CCTAATGATG GTACATGCTT TCAGGAACAC AGTTCTTATG GCAGAAATTC TCTGGAAGAC 1560 

55 

AATTCCTGGG TATTTCCAAG TCCTCCTAAA TCAAGTGAGA CAGCATTTGG GGAAACTAAA 1620 

ACTAAAACTT TGCCTTTACC CAACCTTCCA CCACTGCATT ACTTGGATCA ACATAATCAG 1680 

60 AACTGCCTTT ATAAGAATTA ATTTGGAAGA GATTCACGAT TTCACCATGA GGACACTTAT 1740 
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CTCTTTCAGT GGTCCTCCCA AGAAATTATT 
TTTTGCAGAG GTCTTCAGTA TCTATA1TTG 
ATAGTTGGTT TTCTAGTATG AAAGAGCACC 
ATGCTACTAT ACTAATTAAT AAGTAAACTT 
TAATTGTAAA ATTTTGCCTC TCAGAAGAAT 
AAATGTGAAA TGTCTAAATA TCTGTTCATA 
TGCATAATGG AACAAATGGC AATGTGAGTA 
GATATTGAAA ATATAATGAA ATAAAAGCAT 
CGTTTCAATA TTTAAGATTT AAAGTGATTT 
TTTTAGAATT GAAGTTTGAA TTCTAAGACT 
GTCCCAGCAC ATTCCTTAAG TCCTAATTGG 



TAACAAACTG AANGGAGATT TTGATTAAAA 
AACACACTGT ACAATAGTAC AAAAACCAAC 
CTCTAGCTCC ATATTCTAAG AATCTGAAAT 
AAGGTGTTTA AAAAACTCTG CCTTCTATAT 
GGAATTGGAG ATTGTAGACG TGGTTTTACA 
AAAATAAAAG GAAAACATGT TTCTTCAAAT 
GGTTACATTT CTGTTGTTAT AATGCGTAAA 
CTTAGGTTAT ACCATCTTTA TATGCTATTG 
TTTGGTCACA GTGTTTTGTT GATAAAATTT 
TGAAACAACC TGATCACTGA AGCCAACTTT 
GGAAAAAAAA AAAAAAAAAC TCGA 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NC: 96: 
AGTGCTCTGT TGCCCAGGCT GGAGTGCGTT AGTGTAATGT CAGTCCACTG CAACCTCCAC 
CCCCAGGTTC AAGCAATTCT CATGCCTCAG CCTCCCAAGT AGCTGAAATT ACTGGCATGC 
ACCACCACAC CCAGCTGATG TTTATTTATT TATTTATATA TTTATTTATT TTAGGTGTTT 
TTTTTTTTTT TTTTTGAGAC GGAGTCTTGC TCTGTTGCCC TGGGTGTGGT TACGTGGRAT 
TACCATYCTG GGTGACTCAC TGAAATGTAC TCMCAGTGAG TCATGCCTTC MAATGACATC 
TCAAGTTCTG CCTGCTTGGA GATACATCTG GGGATCTTAA GGGGTGAGGG ACTACTCAAC 
AAGAAGGAAT TTAGCCTGTC TTTTTAAATA AACGGCATTT CTTTTTCCTA KAAAAATGGG 
AAATTCTTCA ATTCTCTAAT ACAGGGACAC TGAGATAACA AAGAGGAAAG TGTCTGGTTG 
GAGGTTGGGA RGCCACCCTG GGGTCTCTCC TACAAAAATG GAAAAGAAAA GAACGGTGAR 
AAATCMAGCA AAGCACAARA AAKTTTCCCT TTGCTAAAAG GGAAAAGATG CCCCMCAATG 
CCCATAAACA TGAACTGGGG ATAAGGAGGA RAATGTCTCT YCTTGGCACC CCCAAACAAA 
CGTTAATTAC CC 
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(2) INFORMATION FOR SEQ ID NO: 97; 

5 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 
10 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

TAAGAACAGA ACAGCAAGTA TGAACCACAT GGAACTTAAA ACATATGGGT GTGAAGTCCA 60 

15 

CTTATGTAGA CAAAACTTAT AATTTCCAAA CTGTTGTCTA GTATACAGTG ATCAGTTGCT 120 

CTCTGTTCAA GTCATTCCAC ACATTTCCCT ATTTTAGGCT ATTATAATAT AGAAAGAAAA 180 

20 TGGGAAGCAT TAGTTGGAGC TAGAAAATGA ACTGTATATT ATTGCTATAT TTGCTAATAC 240 

CAACTATTTC AATAAGTGTT GTACCATATG TAGCATTAAA TATAAAATAC ATAAAAGAAT 300 

GTACAGAAAA TAGCTTTTAT TGAGTAATAT TACATTTCAT TTATACTGTA GCAATATATT 360 

25 

TGTAGGTATA CTCTGTAAGG GCTTTAAATA AAAGAGGTCC ATTAATACTT CCTTATAAAA 420 

ATTCTAGTCT GTTTCATTAC TGCCCAGATG TTTTAGAGAT AAATATTTAT GCAGAAGGTA 480 

30 TTTTKGAAAG TCYCCYTTTG TCTGATAGAG TTTAACNAGA TATTTAAATT TAGTGCYCNA 540 

GAAATCCCAC AAGTCACGGT CTAAACACAC 1TAGAATACT ACAGCATAAA TCTGTTAGCA 600 

TTANTTGCCA AATAAGACAG TTGGGATCCC AAACCCCAAG TCCTTGAGCA ATGTTTTTCC 660 

35 

TCAAAAAGCT GCTATNCCAA TGATATAGGA AAAWACATTG TGTTTTCCTA AACACACTTT 720 

TCTTTTTAAA TGTGCTTCAT TGTTTGATTT GGTCCTGCCT AAATTTCACA AGCTAGGCCA 780 

40 ATGAAGGCTG AATCAAAGAC ATTTCATCCA CCAATATCAT GTGTAGATAT TATGTATAGA 840 

AAATAAAATA AATTATGGCT CTAACTTCTG TGTTGCTGTT TATCTTGTTA TTTTTCGGCG 900 

TTATACTAAT GNGTTTATTG AGAGCATTTT ACCTTCCAGA CTTCTCATGG CTAACTTTTG 960 

45 

GTCTGWATTT TGSTCCTTAG ATGKGAATAT TTCTTAITAG TYTGCTYCCT GCWACGCAAT 1020 

GACTGCATTT CTATCATTTC TCAGTTTGTT AGWATATGTG GATAGTATTC TACTGTATAA 1080 

50 ATGATTGCAA AGTTTATCAA AAACAAATTA TTATATGTAG CTTTTCTACA GTGCTTTGCT 1140 

AAACCATGTA GTACTAGTTA AGTSTTCCTT GAAAATAAAG ATACACTCTT ATAGGGGACA 1200 

GTTCCTGTTC ACTCCCAGGA AACTTTTTTA AAAGATGACA CTGAATGTTT ATTGCACTTT 1260 

55 

AGTGCAGTGA AGTGGCAATA AAACCTAACA TGAATCAAGG TTGTTTATGG CAGATGCATG 1320 

TGTTGCTTTA CAGAGTTTAG CAAAAGCTCT TAATTTTATG TCATACTGTA TTCTACTGAA 1380 

60 TAATAAAGCT AACATTATTC AATAATAAAA TGGAAAAAA 1419 
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5 (2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1487 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

15 GCGACCGCGC CCCTTTCAGC TAGCTCGCTC GCTCGCTCTG CTTCCCTGCT GCCGGCTGCG 60 

CATGGCKWTG GOGTTQGCGG CGCTGGCGGC GGTCGAGCCG GCCTGCGCAG CCGGTACCAG 120 

CAGTTGCAGA ATGAAGAAGA GTCTGGAGAA CCTGAACAGG CTGCAGGTGA TGCTCCTCCA 180 

20 

CCTTACAGCA GCATTTCTGC AGAGAGCGCA GTTTTCCACC TATTTCCCTG GATATTTTGA 240 

TGGTCAGTAC TGGCTCTGGT GGGTGTTCCT TGTTTTAQGC TTTCTCCTGT TTCTCAGAGG 300 

25 ATTTATCAAT TATGCAAAAG TTCQGAAGAT GCCAGAAACT TTCTCAAATC TCCCCAGGAC 360 

CAGAGTTCTC TTTATTTATT AAAGATGTTT TCTGGCAAAG GCCTTCCTGC ATTTATGAAT 420 

TCTCTCTCAA GAAGCAAGAG AACACCTGCA GGAAGTGAAT CAAGATGCAG AACACAGAGG 480 

30 

AATAATCACC TGCTTTAAAA AAATAAAGTA CTGTTGAAAA GATCATTTCT CTCTATTTGT 540 

TCCTAGGTGT AAAATTTTAA TAGTTAATGC AGAATTCTGT AATCATTGAA TCATTAGTGG 600 

35 TTAATGTTTG AAAAAGCTCT TGCAATCAAG TCTGTGATGT ATTAATAATG CCTTATATAT 660 

TGTTTGTAGT CATTTTAAGT AGCATGAGCC ATGTCCCTGT AGTCGGTAGG GGGCAGTCTT 720 

GCTTTATTCA TCCTCCATCT CAAAATGAAC TTGGAATTAA ATATTGTAAG ATATGTATAA 780 

40 

TGCTGGCCAT TTTAAAGGGG TTTTCTCAAA AGTTAAACTT TTGTTATGAC TGTGTTTTTG 840 

CACATAATCC ATATTTGCTG TTCAAGTTAA TCTAGAAATT TATTCAATTC TGTATGAACA 900 

45 CCTGGAAGCA AAATCATAGT GCAAAAATAC ATTTAAGGTG TGGTCAAAAA TAAGTCTTTA 960 

ATTGGTAAAT AATAAGCATT AATTTTTTAT AGCCTGTATT CACAATTCTG CGGTACCTTA 1020 

TTGTACCTAA GGGATTCTAA AGGTGTTGTC ACTGTATAAA ACAGAAAGCA CTAGGATACA 1080 

50 

AATGAAGCTT AATTACTAAA ATGTAATTCT TGACACTCTT TCTATAATTA GCGTTCTTCA 1140 

CCCCCACCCC CACCCCCACC CCCCTTATTT TCCTTTTGTC TCCTGGTGAT TAGGCCAAAG 1200 

55 TCTGGGAGTA AGGAGAGGAT TAGGTACTTA GGAGCAAAGA AAGAAGTAGC TTGGAACTTT 1260 

TGAGATGATC CCTAACATAC TGTACTACTT GCTTTTACAA TGTGTTAGCA GAAACCAGTG 1320 

GGTTATAATG TAGAATGATG TGCTTTCTGC CCAAGTGGTA ATTCATCTTG GTTTGCTATG 1380 

60 
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TTAAAACTGT AAATACAACA GAACATTAAT AAATATCTCT TGTGTAGCAC CTTTAAAAAA 1440 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAN CCCGGGGGGG GGCCCCN 1487 



(2) INFORMATION FOR SEQ ID NO: 99; 

10 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

GCGACCGCGC CCTTCAGCTA GCTCGCTCGC TCGCTCTGCT TCCCTGCTGC CGGCTGCGCA 60 

20 TGGCTTNGGC GTTGGCGGCG CTGGCGGCGG CTCGAGCCGC CTGCGSAGCC GGTACCAGCA 120 

GTTGCAGAAT GAAGAAGAGT CTGGAGAACC TGAACAGGCT GCAGGTGATG CTCCTCCACC 180 

TTACAGCAGC ATTTCTGCAG AGAGCGCACA TNATTTTGAC TACAAGGATG AGTCTGGGTT 240 

TCCAAAGCCC CCATCTTACA ATGTAGCTAC AACACTGCCC AGTTATGATG AAGCGGAGAG 300 

GACCAAGGCT GAAGCTACTA TCCCTTTGGT TCCTGGGAGA GATGAGGATT TTGTGGGTCG 360 

30 GGATGATTTT GATGATGCTG ACCAGCTGAG GATAGGAAAT GATGGGATTT TCATGTTAAC 420 

TTTTTTCATG GCATTCCTCT TTAACTGGAT TGGGTTTTTC CTGTCTTTTT GCCTGACCAC 480 

TTCAGCTGCA GGAAGGTATG GGGCCA1TTC AGGATTTGGT CTCTCTCTAA TTAAATGGAT 540 

35 

CCTGATTGTC AGGTCTTCCA CCTATTTCCC TGCATTTATG AATTCTCTCT CAAGAAGCAA 600 

GAGAACACCT GCAGGAAGTG AATCAAGATG CAGAACACAG AGGAATAATC ACCTGCTTTA 660 

40 AAAAAATAAA GTACTGTTGA AAAGATCATT TCTCTCTATT TGTTCCTAGG TGTAAAATTT 720 

TAATAGTTAA TGCAGAATTC TGTAATCATT GAATCATTAG TGGTTAATGT TTGAAAAAGC 780 

TCTTGCAATC AAGTCTGTGA TGTATTAATA ATGCCTTATA TATTGTTTGT AGTCATTTTA 840 

45 

AGTAGCATGA GCCATGTCCC TGTAGTCGGT AGGGGGCAGT CTTGCTTTAT TCATCCTCCA 900 

TCTCAAAATG AACTTGGAAT TAAATATTGT AAGATATGTA TAATGCTGGC CATTTTAAAG 960 

50 GGGTTTTCTC AAAAGTTAAA CTTTTGTTAT GACTGTGTTT TTGCACATAA TCCATATTTG 1020 

CTGTTCAAGT TAATCTAGAA ATTTATTCAA TTCTGTATGA ACACCTGGAA GCAAAATCAT 1080 

^ AGTGCAAAAA TACATTTAAG GTGTGGTCAA AAATAAGTCT TTAATTGGTA AATAATAAGC 1140 

ATTAATTTTT TATAGCCTGT ATTCACAATT CTGCGGTACC TTATTGTACC TAAQGGATTC 1200 

TAAAGGTGTT GTCACTGTAT AAAACAGAAA GCACTAGGAT ACAAATGAAG CTTAATTACT 1260 

60 AAAATGTAAT TCTTGACACT CTTTCTATAA TTAGCGTTCT TCACCCCCAC CCCCACCCCC 1320 
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ACCCCCCTTA TTTTCCTTTT GTCTCCTGGT GATTAGGCCA AAGTCTGGGA GTAAGGAGAG 
GATTAGGTAC TTAGGAGCAA AGAAAGAAGT AGCTTGGAAC TTTTGAGATG ATCCCTAACA 
TACTGTACTA CTTGCTTTTA CAATGTGTTA GCAGAAACCA GTGGGTTATA ATGTAGAATG 
ATGTGCTTTC TGCCCAAGTG GTAATTCATC TTGGTTTGCT ATGTTAAAAC TGTAAATACA 
ACAGAACATT AATAAATATC TCTTGTGTAG CACCTTTTAW AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAANCCCG GGGGGGGGCC CCN 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
TTTTTTTTTT TTTTTTTTTT TTGACTGAAC TAAGTQGCTT TTTTATTAGA GAAAGCCAGA 
ATTACAAAAG ACTTCCCTTT TCTTGGGGTA TGGCTGTCTC AGCACAATAC TCAACATAAC 
TGCAGAACTG ATGTGGCTCA GGCACCCTGG TTTTAATTCC TTGAGGATCT GGCAATTGGC 
TTACGCAAAA GGTCACCATT TGAGGTCCTG CCTTACTAAT TATGTGCTGC CCAACAACTA 
AATTTGTAAT TTGTTTTTCT CTAGTTTGAG CAGGGTCTGA ATTTTTTCAT TTATTTCCTT 
TTTTGCCAGC AGACAGACTT GAGTCTGTAA AGACAAGCAA ATACACTGAC AGAAGTTTAC 
CATAGTTTCT AAAATGTAAA AAAGAAAACC CCCAAAAGAC TCAAGAAAAT TAGACCACAA 
ATTTTGCATT GTTCATTGTA GCACTATTGG TAATAAAATA ACAAATGTTT GTGCATTTTT 
ATGTGAAGAT CCTTCTCGTA TTTCATTTGG AAAGATGAGC AAGAGGTCTG CTTCCTTCAT 
TTTACTTCCC CrTCTCTTTT TGAAAGQCAG TTTCGCCAAG CTTAATGCAA GAATATCTGA 
CTGTTTAGAA GAAAGATATT GCCACAATCT CTGGATGGTT TTCCAGGGTT GTGTTATTAC 
TGAGCTTCAT CTTTCCAGAA TGAGCAAAAC ACTGTCCAGT CTTTGTTACG ATTTTGTAAT 
AAATGTGTAC ATTTTTTTTA AATTTTTGGA CATCACATGA ATAAAGGTAT GTATGTACGA 
ATGTGTATAT ATTATATATA TGACATCTAT TTTGGAAAAT GTTTGCCCTG CTGTACCTCA 
TTTTTAGGAG GTGTGCATGG ATGCAATATA TGAAAATGGG ACATTCTGGA ACTGCTGGTC 
AGGGGACTTT GTCGCCCTGT GCACTAAAAG GGCCAGATTT TCAGCAGCCA AGGACATCCA 
TACCCAAGTG AATGTGATGG GACTTAAAAG AAGTGAACTG AGACAATTCA CTCTGGCTGT 
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TTGAACAGCA GCGTTTCATA GGAAGAGAAA AAAAGATCAA TCTTGTATTT TCTGACCACA 
TAAAGGCTTC TTCTCTTTGT AATAAAGTAG AAAAGCTCTC CTCAAAAAAA AAAAAAAAAA 
AAAAA 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TACCCGGCGG ATTCCAGGAA GGTAAATTTA GTCCTATAAT TTTCAGCTTA ATTATAAACA 
AAGGAACAAA TAAGTGGAAG GGCAGCTATT ACCATTCGCT TAGTCAAAAC ATTCGGTTAC 
TGCCCTTTAA TACACTCCTA TCATCAGCAC TTCCACCATG TATTACAAGT CTTGACCCAT 
CCCTGTCGTA ACTCCAGTAA AAGTTACTGT TACTAGAAAA TTTTTATCAA TTAACTGACA 
AATAGTTTCT TTTTAAAGTA GTTTCTTCCA TCTTTATTCT GACTAGCTTC CAAAATGTGT 
TCCCTTTTTG AATCGAGGTT TTTTTGTTTT GTTTTGTTTT CTGAAAAAAT CATACAACTT 
TGTGCTTCTA TTGCTTTTTT GTGTTTTGTT AAGCATGTCC CTTGGCCCAA ATGGAAGAGG 
AAATGTTTAA TTAATGCTTT TTAGTTTAAA TAAATTGAAT CATTTATAAT AATCAGTGTT 
AACAATTTAG TGACCCTTGG TAGGTTAAAG GTTGCATTAT 1TATACTTGA GATTTTTTTC 
CCCTAACTAT TCTGTTTTTT GTACTTTAAA ACTATGGGGG AAATATCACT GGTCTGTCAA 
GAAACAGCAG TAATTATTAC TGAGTTAAAT TGAAAAGTCC AGTGGACCAG GCATTTCTTA 
TATAAATAAA ATTGGTGGTA CTAATGTGAA AAAAAAAAAA AAAAAAAACT CGAGGGQGGC 
CCGGTACCCT ATTA 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CCGCGGGAAC GCTGTCCTGG CTGCCGNCAC CCGAACAGCC TGTCCTGGTG CCCCGGCTCC 
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CTGCCCCGCG 
TGCTGCTGCT 
TCCGGACCCT 
CTTTTGGAGA 
ACACCTCCCT 
GTCTGGAGCA 
CTCACTTGGC 
AGTATGACGT 
GCATTTTGCC 
CCTATACAGA 
AAACAAGAGC 



CCCAGTCATG 
CAGTGCGGCG 
CCAAGTGGAG 
CACGCTTCAC 
GACCAGAGAC 
GAGTCTTCTC 
CTATGGAAAA 
GGAGCTGATT 
TCTQGTAGGG 
AAGGCCAATA 
AAAAAGAAAT 



ACCCTGCGCC 
GTGTGCCGGG 
ACCCTGGTGG 
ATACACTACA 
CCTCTGGTTA 
GACATGTGTG 
CGGGGATTTC 
GCACTAATCC 
ATGGCCATGG 
GACCCAAAGT 
AATAAATAAT 



CCTCACTCCT 
CTGAGGCTGG 
AGCCCCCAGA 
CGGGAAGCTT 
TAGAACTTGG 
TGGGAGAGAA 
CACCATCTGT 
GAGCCAACTA 
TGCCACCCTC 
CTCCAAAAAG 
AAATTTTAAA 



CCCGCTCCAT 
GCTCGAAACC 
ACCATGTGCC 
GGTAGATGGA 
CCAAAAGCAG 
GCGAAGGGCA 
CCCAGCGGAT 
CTGGCTAAAG 
CTGGGCCTCA 
AAGCTCAAGG 
AAACTTAAAA 



CTGCTGCTGC 
GAAAGTCCCG 
GAGCCCGCTG 
CGTATTATTG 
GTGATTCCAG 
ATCATTCCTT 
GCAGTGGTGC 
CTQGTGAAGG 
TTGGGTATCA 
AAGAGAAACG 
AAA 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CCGATGTGGA CATCATCCTG TCTATCCCCA TGTTCCTGCG CCTGTACCTG ATCGCCCGAG 
TCATGCTGCT GCACAGAAGC TCTTCACCGA TGCCTCGTCC CGCAGCATCG GGGCCCTCAA 
CAAGATCAAC TTCAACACCC GCTTTGTCAT GAAGACGCTC ATGACCATCT GCCCTGGCAC 
TGTGCTGCTC GTGTTCAGCA TCTCTCTGTG GATCATTGCT GCCTGGACCG TCCGTGTCTG 
TGAAAGTCCT GAATCACCAG CCCAGCCTTC TGGCTCATCA CTTCCTGCTT GGTACCATGA 
CCAGCAGGAC GTAACTAGTA ACTTTCTGGG TGCCATGTGG CTCATCTCCA TCACATTCCT 
TTCCATTGGT TATGQGGACA TGGTGCCCCA CACATACTGT GGGAAAGGTG TCTGTCTCCT 
CACTGGCATC ATGGGTGCAG GCTGCACTGC CCTTGTGGTG GCCGTGGTGG CCCGAAAGCT 
GGAACTCACC AAAGCGGAGA AGCACGTTCA TAANTTCATG ATGGACACTC AGCTCACCAA 
GCGGATCAAG AATGYTGCAG CCAATGTCCT TSGGGAAACA TGGTTAATCT ATAAACACAC 
AAAGYTGYTA AAGAAGATTG ACCATGCCAA AGTGAGGAAC ACCAGAGGAA GTTCYTCCAA 
GTATCCACCA GTTGAGGAGC GTCAAGATGG AACAGAGGAA GCTGAGTGAC CAAGCCAACA 
NTCTGGTGGA CCTTTCCAAG ATGCAGAATG TCMTGTATGA CTTAATCACA GAACTCAATG 
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ACCGGAGCGA AGACCTGGAG AAGCAGATTG GCAGCCTGGA GTCGAAGCTG GAGCATCTCA 840 

CCGCCAGCTT CAACTCCCTG CCGCTGCTCA TCGCCGACAC CCTGCGCCAG CAGCAGCAGC 900 

5 

AGCTCCTGTC TGCCATCATC GAGGCCCGGG GTGTCAGCGT GGCAGTGGGC ACCACCCACA 960 

CCCCAATCTC CGATAGCCCC ATTGGGGTCA GCTCCACCTC CTTCCCGACC CCGTACACAA 1020 

10 GTTCAAGCAG TTGCTAAATA AATCTCCCCA CTCCAGAAGC ATTAAAAAAA AAAAAAAAAA 1080 

15 (2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 489 base pairs 

(B) TYPE: nucleic acid 
20 {C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

25 GGCACGAGAG GCTTTGAAGC ATTTTTGTCT GTGCTCCCTG ATCTTCAGGT CACCACCATG 60 

AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC TGCCCAGAAT 120 

CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA TGATGAAGCC 180 

30 

CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA CTGCTGCTCC TACCACTGCA 240 

ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA ATGGGTTGGG 300 

35 GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA GCTTGAGTCT TCTGCAATTG 360 

GTCACAACTA TTCATGCTTC CTGTGATTTC ATCCAACTAC TTACCTTGCC TACGATATCC 420 

CCTTTATCTC TAATCAGTTT ATTTTCTTTC AAATAAAAAA TAACTATGAG CAACAAAAAA 480 

AAAAAAAAA 489 



40 



45 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 base pairs 

50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

55 

GCGGTCGCCG CTGTTGTTGT GGTCCCCATG GAGCTGCCGT AGCGGACCCA GCACAGCCAG 60 
GAGCGTCCGG GATGAGCTCA GCCGCGGCCG ACCACTGGGC GTGGTTGCTG GTGCTCAGCT 120 
60 TCGTGTTTGG ATGCAATGTT CTTAGGATCC TCCTCCCGTC CTTCTCATCC TTCATGTCCA 180 
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GGGTGCTGCA GAAGGACGCG GAGCAGGAGT CACAGATGAG AGCGGAGATC CAGGACATGA 240 

AGCAQGAGCT CTCCACAGTC AACATGATGG ACGAGTTTGC CAGATATGCC AGGCTGGAAA 300 

5 

GAAAGATCAA CAAGATGACG GATAAGCTCA AAACCCATGT GAAAGCTCGG ACAGCTCAAT 360 

TAGCCAAGAT AAAATGGGTG ATAAGTGTCG CTTTCTACGT ATTGCAGGCT GCCCTGATGA 420 

10 TCTCACTCAT TTGGAAGTAT TATTCTGTCC CTGTGGCTGT CGTGCCGAGT AAATGGATAA 480 

CCCTYTAGAC CGCCTGGTAG CCTTTCCYAY TAGAGTAGCA GGTGGTGTTG GAATTACTGT 540 

TGGATTTART CTGTACAAAT TGTCCTATTG TGdTCACCG TYCASTGAAC AGGAGGTGGT 600 

15 

ACAGCCGGAG TTAAAAACGG TTTCCNTTCC AGTTTAAAAT 640 



20 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

30 

GGGCACNAGA TGGAGCTGCC 
CAGCCGCGGC CGACCACTGG 
35 TTCTTAGGAT CCTCCTCCCG 
CGGACAGGAG TCACAGATGA 
CAACATGATG GACGAGTTTG 

40 

GGATAAGCTC AAAACCCATG 
GATAAGTGTC GCTTTCTACG 
45 TTATTCTGTC CCTGTGGCTG 
AGCCTTTCCT ACTAGAGTAG 
CAAAGTTGTC GCTATTGTGC 

50 

AGTAAAAAAA CGGATTTCCT 
AAGAAACAAA AGTGCATAGT 
55 TATGAGATAG TCTTATAAGA 
AGTTTATGAC ACGTATGTAC 
TTAGGGACAT CTCCATGCTG 

60 



GTAGCGGACC CAGCACAGCC AGGAGCGTCC GGGATGAGCT 60 

GCGTGGTTGC TGGTGCTCAG CTTCGTGTTT GGATGCAATG 120 

TCCTTCTCAT CCTTCATGTC CAGGGTGCTG CAGAAGGACG 180 

GAGCGGAGAT CCAGGACATG AAGCAGGAGC TCTCCACAGT 240 

CCAGATATGC CAGGCTGGAA AGAAAGATCA ACAAGATGAC 300 

TGAAAGCTCG GACAGCTCAA TTAGCCAAGA TAAAATGGGT 360 

TATTGCAGGC TGCCCTGATG ATCTCACTCA TTTGGAAGTA 420 

TCGTGCCGAG TAAATGGATA ACCCCTCTAG ACCGCCTGGT 480 

CAGGTGGTGT TGGAATTACC TGTTGGATTT TAGTCTGTAA 540 

TTCATCCGTT CAGCTGAACA GGAGGATGGA TACAGCCGCG 600 

CTTCCTAGCT TAAAATCTGA TTTACACTGT TTTGTTTTTT 660 

TTAGATTTTT TTTTTGTTGA ATATGTTTGT TCTTGGACTT 720 

ATCACGATTT TCTACACCTG TCATTGAGCC AAGAAAGTCC 780 

TAGTGAACAC CGTCCTCGAT CTGTACGAAA TGTGAAATGT 840 

TCACTTGTGA TTTGCCCTCT TATGTATTTT GGTCATATTG 900 
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CCAACTGGAA AGTCAAAATT TTCTAACAAC TTTAAGTAAG TTCTTTGAAG ACTTAGTGCT 960 

GTTTTTAATC CAGTTTAGAA AGTAACTTAA TTTTAATACC RCTACTAAAA ATTCGAAAAT 1020 

5 TTCTTCTTTA ATCACATTCA ATATGGTTAA AAGAACAACA CTAATTGACA TTGCGTGGGC 1080 

TTTTTCTCCC TTTGTTTAAA ATGTCATTTG TTGAGCAAGA GTTGTATAGT ATTATCTACT 1140 

TACTTGAGGC TGTTAATTTT TCATTACAGT GTTTTGTAAA TGTATCCACG AGACCATGAT 1200 

10 

GCATTGTTTT GTGCTCAACT TGTGTTTTGT ATTTAAAGCA TCTTGAATGA AGTGTATTTT 1260 

ATAAGCATTT AATATTTATG CTCTTTAGAA TGGAACACAG AAAACAAACC TTATAAGTCC 1320 

15 TGATTAATCT GAACCAATAA CCTGTGTGGC CTACAAAGTA TAATTCTATT AAATGTTCCT 1380 

TAAAACACTT TTTTCTAATT AAAATCTTTG CAAATGCTTG TGTAACTTCC TGCCTTACAG 1440 

CTACTTGTTT GCTGTGAGCC ACCCGCAACT GACAAGTGGC TGTTAACTGA GTCACCATAT 1500 

20 

CCCAGTAAAG CTGAATTTTC TCACTAAAA 1529 



25 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH; 2435 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

35 

ATGAAGGGTC GTTGGTGGGA AAGATGGCGG CGACTCTGGG ACCCCTTGGT CGTGGCAGCA 60 

GTGGCGRCGA TGTTTGTCGG CTCGGGATGG GTCCAGGATG TTACTCCTTC TTCTTTTGTT 120 

40 GGGGTCTGGG CAGGGGCCAC AGCAAGTCGG GGCGGGTCAA ACGTTCGAGT ACTTGAAAOG 180 

GGAGCACTCG CTGTCGAAGC CCTACCAGGG TGTGGGCACA GGCAGTTCCT CACTGTGGAA 240 

TCTGATGGGC AATGCCATGG TGATGACCCA GTATATCCGC CTTACCCCAG ATATGCAAAG 300 

45 

TAAACAGGGT GCCTTGTGGA ACCGGGTGCC ATGTTTCCTG AGAGACTGGG AGTTGCAGGT 360 

GCACTTCAAA ATCCATGGAC AAGGAAAGAA GAATCTGCAT QGGGATGGCT TGGCAATCTG 420 

50 GTACACAAAG GRWTCGGATG CAGCCAGGGC CTGTNTTTGG GAAACATGGA CAAATTTGTG 480 

GGGCTGGGAG TATTTGTAGA CACCTACCCC AATGAGGAGA AGCAGCAAGA GCGGGTATTC 540 

CCOTRCMTCT CAGCCATGGT GAACAACGGC TCCCTCAGCT ATGATCATGA GCGGGATGGG 600 

55 

CGGCCTACAG AGCTGGGAGG CTGCASAGCC ATTGTCCGCA ATCTTCATTA CGACACCTTC 660 

CTGGTGATTC GCTACGTCAA GAGGCATTTR ACGATAATGA TGGATATTGA TGGCAAGCAT 720 

60 GAGTGGAGGG ACTGCATTGA AGTGCCCGGA GTCCGCCTGC CCCGCGGCTA CTACTTCGGC 780 
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ACCTCCTCCA TCACTGGGGA TCTCTCAGAT AATCATGATG TCATTTCCTT GAAGTTGTTT 840 

GAACTGACAG TGGAGAGAAC CCCAGAAGAG GAAAAGCTCC ATCGAGATGT GTTCTTGCCC 900 

5 

TCAGTGGACA ATATGAAGCT GCCTGAGATG ACAGCTCCAC TGCCGCCCCT GAGTGGCCTG 960 

GCCCTCTTCC TCATCGTCTT TTTCTCCCTG GGTGTTTTCT GTATTTGCCA TAGTCATTGG 1020 

10 TATCATACTC TACAACAAAT GGCAGGAACA GAGCCGAAAG CGCTTCTACT GAGCCCTCCT 1080 

GCTGCCACCA CTTTTGTGAC TGTCACCCAT GAGGTATGGA AGGAGCAGGC ACTGGCCTGA 1140 

GCATGCAGCC TGGAGAGTGT TCTTGTCTCT AGCAGCTGGT TGGGGACTAT ATTCTGTCAC 1200 

15 

TGGAGTTTTG AATGCAGGGA CCCCGCATTC CCATGGTTGT GCATGGGGAC ATCTAACTCT 1260 

GGTCTGGGAA GCCACCCACC CCAGGGCAAT GCTGCTGTGA TGTGCCTTTC CCTGCAGTCC 1320 

20 TTCCATGTGG GAGCAGAGGT GTGAAGAGAA TTTACGTGGT TGTGATGCCA AAATCACAGA 1380 

ACAGAATTTC ATAGCCCAGG CTGCCGTGTT GTTTGACTCA GAAGGCCCTT CTACTTCAGT 1440 

TTTGAATCCA CAAAGAATTA AAAACTGGTA ACACCACAGG CTTTCTGACC ATCCATTCGT 1500 

25 

TGGGTTTTGC ATTTGACCCA ACCCTCTGCC TACCTGAGGA GCTTTCTTTG GAAACCAGGA 1560 

TGGAAACTTC TTCCCTGCCT TACCTTCCTT TCACTCCATT CATTGTCCTC TCTGTGTGCA 1620 

30 ACCTGAGCTG GGAAAGGCAT TTGGATGCCT CTCTGTTGGG GCCTGGGGCT GCAGAACACA 1680 

CCTGCGTTTC ACTGGCCTTC ATTAGGTGGC CCTAGGGAGA TGGCTTTCTG CTTTGGATCA 1740 

CTGTTCCCTA GCATGGGTCT TGGGTCTATT GGCATGTCCA TGGCCTTCCC AATCAAGTCT 1800 

35 

CTTCAGGCCC TCAGTGAAGT TTGGCTAAAG GTTGGTGTAA AAATCAAGAG AAGCCTGGAA 1860 

GACATCATGG ATGCCATGGA TTAGCTGTGC AACTGACCAG CTCCAGGTTT GATCAAACCA 1920 

40 AAAGCAACAT TTGTCATGTG GTCTGACCAT GTGGAGATGT TTCTGGACTT GCTAGAGCCT 1980 

GCTTAGCTGC ATGTTTTGTA GTTACGATTT TTGGAATCCC ACTTTGAGTG CTGAAAGTGT 2040 

AAGGAAGCTT TCTTCTTACA CCTTGGGCTT GGATATTGCC CAGAGAAGAA ATTTGGCTTT 2100 

45 

TTTTTTNCTT AATGGACAAG AGACAGTTGC TGTTCTCATG TTCCAAGTCT GAGAGCAACA 2160 

GACCCTCATC ATCTGTGCCT GGAAGAGTTC ACTGTCATTG AGCAGCACAG CCTGAGTGCT 2220 

50 GGCCTCTGTC AACCCTTATT CCACTGCCTT ATTTGACAAG GGGTTACATG CTGCTCACCT 2280 

TACTGCCCTG GGATTAAATC AGTTACAGGC CAGAGTCTCC TTGGAGGGCC TGGAACTCTC 2340 

AGTCCTCCTA TGAACCTCTG TAGCCTAAAT GAAATTCTTA AAATCACCGA TGGAACCAAA 2400 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAN 2435 



55 



60 
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(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

ATGAAACTTA AGAATTGAAT TGGAAAGACT TCTCAAAGAG AATTGTATGT AACGATGTTG 60 

TATTGATTTT TAAGAAAGTA ATTTAA0TTG TAAAACTTCT GCTCGTTTAC ACTGCACATT 120 

15 GAATACAGGT AACTAATTGG AAGGAGAGGG GAGGTCACTC TTTTGATGGT GGCCCTGAAC 180 

CTCATTCTGG TTCCCTGCTG CGCTGCTTGG TGTGACCCAC GGAGGATCCA CTCCCAGGAT 240 

GACGTGCTCC GTAGCTCTGC TGCTGATACT GGGTCTGCGA TGCAGCGGCG TGAGGCCTGG 300 

20 

GCTGGTTGGA GAAGGTCACA ACCCTTCTCT GTTGGTCTGC CTTCTGCTGA AAGACTCGAG 360 

AACCAACCAG GGAAGCTGTC CTGGAQGTCC CTGGTCGGAG AGGGACATAG AATCTGTGAC 420 

25 CTCTGACAAC TGTGAAGCCA CCCTGGGCTA CAGAAACCAC AGTCTTCCCA GCAATTATTA 480 

CAATTCTTGA ATTCCTTGGG GATTTTTTAC TGCCCTTTCA AAGCACTTAA GTGTTAGATC 540 

TAACGTGTTC CAGTGTCTGT CTGAGGTGAC TTAAAAAATC AGAACAAAAC TTCTATTATC 600 

30 

CAGAGTCATG GGAGAGTACA CCCTTTCCAG GAATAATGTT TTGGGAAACA CTGAAATGAA 660 

ATCTTCCCAG TATTATAAAT TGTGTATTTA AAAAAAAGAA ACTTTTCTGA ATGCCTACTG 720 

35 GCGGTGTATA CCAGGCAGTG TGCCAGTTTA AAAAGATGAA AAAGAATAAA AACTTTTGAG 780 

GAACAAAAAA AAAAAAAAAA AAATT 805 

40 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1166 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GGCACGAGAG GCGCCAGTCG CAGGTGTGCT GCTGAGGCGT GAGAATGGCG TCCCGCGGCC 60 

GGCGTCCGGA GCATGGCGGA CCCCCAGAGC TGTTTTATGA CGAGACAGAA GCCCGGAAAT 120 

55 

ACGTTCGCAA CTCACGGATG ATTGATATCC AGACCAGGAT GGCTGGGCGA GCATTGGAGC 180 

TTCTTTATCT GCCAGAGAAT AAGCCCTGTT ACCTGCTGGA TATTGGCTGT GGCACTQGGC 240 

60 TGAGTGGAAG TTATCTGTCA GATGAAGGGC ACTATTGGGT GGGCCTGGAT ATCAGCCCTG 300 
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CCATGCTGGA TGAGGCTGTG GACCGAGAGA TAGAGGGAGA CCTGCTGCTG GGGGATATGG 360 

GCCAGGGCAT CCCATTCAAG CCAGGCACAT TTGATGGTTG CATCAGCATT TCTGCTGTGC 420 

5 

AGTGGCTCTG TAATGCTAAC AAGAAGTCTG AAAACCCTGC CAAGCGCCTG TACTGCTTTT 480 

TTGCTTCTCT TTTTTCTGTT CTCGTCCGGG GATCCCGAGC TGTCCTGCAG CTGTACCCTG 540 

10 AGAACTCAGA GCAGTTGGAG CTGATCACAA CCCAGGCCAC AAAGGCAGGC TTCTCCGGTG 600 

GCATGGTGGT AGACTACCCT AACAGTGCCA AAGCAAAGAA ATTCTACCTC TGCTTGTTTT 660 

CTGGGCCTTC GACCTTTATA CCAGAGGGGC TGAGTGAAAA TCAGGATGAA GTTGAACCCA 720 

15 

GGGAGTCTGT GTTCACCAAT GAGAGGTTCC CATTAAGGAT GTCGAGGCGG GGAATGGTGA 780 

GGAAGAGTCG GGCATGGGTG CTGGAGAAGA AGGAGCGGCA CAGGCGCCAG GGCAGGGAAG 840 

20 TCAGACCTGA CACCCAGTAC ACCGGCCGCA AGCGCAAGCC CCGCTTCTAA GTCACCACGC 900 

"ggttctggaa AGGCACTTGC CTCTGCACTT 1TCTATATTG TTCAGCTGAC AAAGTAGTAT 960 

TTTAGAAAAG TTCTAAAGTT ATAAAAATGT TTTCTGCAGT AAAAAAAAAG TTCTCTGGGC 1020 

25 

CGGGCGTGGT GGCTCACANC TGTAATCCCA GCACCTTGGG AGGCTGAGGT GGGAGGATCA 1080 

TTTGAGGCCA GGAGTTTGAG ACCTGCCTGG GCAACATAAT GAAACTTCCT TTCCAGGGAG 1140 

30 AAAAAAAAAA AAAAAAAAAA ACTCGA 1166 



35 (2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 586 base pairs 
{B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

45 AGAGCGGACG AAGCTQGATA ACAGGGGACC GATGATGTGG CGACCATCAG TTCTGCTGCT 60 

TCTGTTGCTA CTGAGGCACG GGGCCCAGGG GAAGCCATCC CCAGACGCAG GCCCTCATGG 120 

CCAGGGGAGG GTGCACCAGG CGGCCCCCCT GAGCGACGCT CCCCATGATG ACGCCCACGG 180 

50 

GAACTTCCAG TACGACCATG AGGCTTTCCT GGGACGGGAA GTGGCCAAGG AATTCGACCA 240 

ACTCACCCCA GAGGAAAGCC AGGCCCGTCT GGGGCGGATC GTGGACCGCA TGGACCGCGC 300 

55 GGGGGACGGC GACGGCTGGG TGTCGCTGGC CGAGCTTCGC GCGTGGATCG CGCACACGCA 360 

GCAGCGGCAC ATACGGGACT CGGTGAGCGC GGCCTGGGAC ACGTACGACA CGGACCGCGA 420 

CGGGCGTGTG GGTTGGGAGG AGCTGCGCAA CGYCACCTAT GGCCACTASG SGCCCGKTGA 480 

60 
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AGAATTTCAT GACGTGGAGG ATGCAGAGAC YTACAAAAAG ATGCTGGYTC GGGACGAGCG 540 
GCGTTTCCGG GTGGCCG&CC AGGATGGGGA CTCGATGGCC ACTCGA 586 

5 

(2) INFORMATION FOR SEQ ID NO: 111: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEWESS: double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
ACCCATTGAG CAGAAGGAGG CCAGGTGGGA AAGCTCCTGG GAAGAGCAGC CAGACTGGAC 60 



20 ACTGGGCTGC TTGAGTCCTG AGTCACAATT CAGAATTCCT GGGCTCCCTG GGTGCATTCT 120 



ATCATTCCAG TTGAAAGTTT GCTTCCTTCC AGTCATGTGG CTCTTCATTC TACTCTCCTT 180 



GGCTCTCATT TCAGATGCCA TGGTCATGGA TGAAAAGGTC AAGAGAAGTT TGTGCTGGAC 240 

25 

ACGGCTTCTG CCATCTGCAA CTACAATGCC CAYTACAAGA ATCACCCCAA ATACTGGTGC 300 



CGAGGYTATT TCCGTGAYTA CTGCAACATC ATCGCCTTCT CCCCTAACAG CACCAATCAT 360 



30 GTGGCCCTGA AGGACACAGG GAACCAGCTC ATTGTCACTA TGTCCTGCCT GAACAAANAA 420 



GACACGGGCT GGTACTGGTG TGGCATCCAR CGGGACTTTG CMAGGGATGA CATGGATTTT 480 



ACAGAGCTGA TTGTAACTGA CGACAAAGGA ACCCTGGCCA ATGACTTTTG GTCTGGGAAA 540 

35 

GACCTATCAG GCAACAAAAC CAGAAGCTGC AAGGCTCCCA AAGTTGTCCG CAAGCTGACC 600 



GCTCCAGGAC GTCCATTCTC ATCATTTGCA TACTGATCAC GGGTTTGGGA ATCATCTCTG 660 



40 TAATCAGTCA TTTGACCAAA AGGAGGAGAA GTCAAAGGAA TAGAAGGGTA GGCAACACTT 720 



TGAAGCCCTT CTCGCGTGTC CTGACTCCAA AGGAAATGGC TCCTACTGAA CAGATGTGAC 780 



TGAAGWTTTT TTTAATTTAG TTNCATAAAG TGATGNCTAC AACAGAWTAA TCACCCATGA 840 

45 

CAACTGGCCC CACACCTCAG AGACTGATTC TGATCTCCCA GGAATTCTGA AGGACCCTCT 900 



ATCCTTGACA ACAATCATTT GCAGCCAGGT AGCAACGGCR GTAGTCAGAG GAGCTATGAT 960 



50 AGACCACACC CAAGCAAGGC TGCCCTCAAA TAACATCTCA AGATCTTAGT TCTTATGCAT 1020 



TCCATCAGTC AGAAGTGAAG AAGAGGTGGA GAATCTKGAT TGGGGACCAG GAAATCACTT 1080 



GTATTTTGTT AGCCAATAAA TTCCTAGCCA GTGTTGAATG AAAAAAAAAA AAAA 1134 

55 



60 



(2) INFORMATION FOR SEQ ID NO: 112: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

CACTTTAAAG CTCTGCTGAG GGAGTTCGGA GCCCAGGCTT TCAGGCGACC TCTGCCCTCC 60 

CTGCCTCTCC TCACCCTCCC TCTCTTCCTG CAGGGCCTGG GAAGGGCTTT GAGGGAGCCT 120 

GGGAGCCATG TGAAGAGGGG CACGCCTGGG CTGTCCCACA GTTTAGATCC AGTTGGAGGT 180 

15 TCTCCCTGGC TCCTGCAGGC CTGCGGGGAT CTCTCCCCAC TTCAGGCCTC CGGCAGCTGC 240 

CTGCCCTCTT GTCTGTGCTT CAGCCCTGCA CAAAAGCAGC TTGGTGACAC CACTCAGCCA 300 

CCCAGAGTAC GTGTTTACAG GCTTTCCAGA TCACCTTCCT GTGGGGTGAA CGTAATGAGG 360 

20 

CGGGGCTGGT CCTTGGAATT TCCCCTGGAA AATGGTAACA GACTCCATCC TTGACCCGGG 420 

GATGAGCATG AAGGCATTGT CCCAAAGGCA GAGGCCACCG TGGTAGGAAT TCCACCAAGG 480 

25 CCAGAAGGGA AAAAGGAAGA ACCCACCGTG TCTGGCTGTG CGGGCCCTGG GGAGGGTCGT 540 

GAGTGCAGCC CCTCTCTACT TCYGTGCCTT TGTAAAACGT GTAGATAACC GCAGTGGTTG 600 

GCTGAGCCAA GAACTCTCCT AAATCAGTGG CTTTCTCCCC ACCCCTTGCT GGGGAGTCAT 660 

30 

TTTTAAAAAA ATCTGTGGGA TATAAAATTG GCCTCCTGCT GCTTCAGCCT ACCTCTCCCT 720 

CTGCTGACTT AATGTCGTGA TTCTGTTTCT TCAGATATTT AAGGCTGTTA GGTTGTGTGA 780 

35 GCCTTGAAGT GTGTGTGTGT GTCCCAGCGA CTGTCCACTG TCCAGGAGAT GCATGTCTTT 840 

GTATTGGAGA TATTTCTGTA ACTCATTCTC TTGGTGCTCA CGATTGCCAT GGCCATAGGG 900 

CCACAGTGCC GTATCTGCTG CAGACATGAT TGTCTCTTGT TCTAGAGGTT TTCTTGTTIT 960 

40 

CGAATCTTGC CTGATGAATC CAGCCAGACC AAGGGGCCTA GATTTGACCT CTGTCCTGGG 1020 

CTCCTGGGCC AGGTGCAGGA ACATCTGAGG CCACTCTGCT GGCCACCTCC AGTGGGTGCT 1080 

45 GACCACAGGA TGGGCTTTGT TTACACTCAT TTTCACCCTG ATTCTTGCCC CCACTTTCAT 1140 

AAAAGAAACT TCAAAATGCT GACGCTTTGG AGAGTAAGAA AATCAATCTT GGCTGGGCAC 1200 

GGTGGCTCCT GCCTGTGATC CTAGCACTTT GGGAGGCTGA AGCTGAAGGA TCACTTGAGC 1260 

TCAGGAGTTG GAGACCAACC CTGGCAACAT AACAAGACCC TGTCTCTACA AAAAAAAAAA 1320 

AAAAAAAACT CGA 1333 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 113: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1015 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

GGCACGAGCG GCACGAGCGG CACGAGGTGA CTTCAAGTGT CGGATCTTTT CAGCCTACAT 60 
10 CAAGGAGGTG GAGGAACGGC CGGCACCCAC CCCGTGGGCT CCAAGATGCC CTTTGGGGAA 120 

CTGATGTTCG AATCCAGCAG TAGCTGCGGC TGGGTACATG GCGTCTGTTT CTCAGCCAGC 180 
^ GGGAGCCGCG TGGCCTGGGT AAGCCACGAC AGCACCGTCT GCCTGGCTGA TGCCGACAAG 240 

AAGATGGCCG TCGCGACTCT GGCCTCTGAA ACACTACCAC TGCTGGCGCT GACCTTCATC 300 

ACAGACAACA GCCTGGTGGC AGCGGGCCAC GACTGCTTCC CGGTGCTGTT CACCTATGAC 360 
20 GCCGCCGCGG GGATGCTGAG CTTCGGCGGG CGGCTGGACG TTCCTAAGCA GAGCTCGCAG 420 

CGTGGCTTGA CGGCCCGCGA GCGCTTCCAG AACCTGGACA AGAAGGCGAG CTCCGAGGGT 



25 



480 



GGCACGGCTG CGGGCGCGGG CCTAGACTCG CTGCACAAGA ACAGCGTCAG CCAGATCTCG 540 



GTGCTCAGCG GCGGCAAGGC CAAGTGCTCG CAGTTCTGCA CCACTGGCAT GGATGGCGGC 600 

ATGAGTATCT GGGATGTGAA GAGCTTGGAG TCAGCCTTGA AGGACCTCAA GATCAAATGA 660 

30 CCTGTGAGGA ATATGTTGCC TTCATCCTAG CTGCTGGGGA AGCGGGGAGA GGGGTCAGGG 720 

AGGCTAATGG TTGCTTTGCT GAATGTTTCT GGGGTACCAA TACGAGTTCC CATAGGGGCT 780 

GCTCCCTCAA AAAGGGAGGG GACAGATGGG GAGCTTTTCT TACCTATTCA AGGAATACGT 840 

35 

GCCTTTTTCT TAAATGCTTT CATTTATTGA AAAAAAAAAA AAATGCCCCC AAAGCACTAT 900 

GCTGGTCATG AACTGCTTCA AAATGTGGAG GTAATAAAAT GCAACTGTGT AAAAAAAAAA 960 

40 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AACNC 1015 



45 (2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1076 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
55 GGCACGAGGG GAAAGCCATG CTCCCAGGAC TCCTTCCTTG CAGCCTTAAA TCGGTCTCTA 60 
CGGAAAATTC CGCGCCTTAG AAACCCACGC TTGGGTGTAA CITA1TA1TG TTCTTCCTGA 120 
CCTACTTCCT GTTTATCACT TCCGGGTTCA TCATTTTGGC ATTTCGGTGA TCGGGTTGGA 180 

60 
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ACTATTGAAG CCCGCTTTCA GGTTCTTTTC 
GCTTCTCCTA AATCTCCGTT CTCTGGGTAA 
5 GAAATGCGAG GGCCTCGGGT GTTACTCTAA 
GCGATTCCTG CAGCGGAAGA GGCGTGATCT 
ATGTCGGACC CACGGAGGCC GAACAAAGTG 

10 

AACCCGGCCT TGGACGACCC GACGCCGGAC 
ATGTGCGGCC TCATGCTTAA GCTGAAGTGG 
15 ATCAGCTTTG CCAACTCTCG GAGCTCGGAG 
CTGTCCATCT CTGCCGTGGT GATGTCCTAT 
TGGTGATACC AGCCTAGAAG GGTCACATTT 

20 

TTGGCTGCTA AACCTGCTGC CTTCAGCTGC 
CGGTGCCCCC AGCTGGATAG AGGGAACCTG 
25 CCCTCCTGCC TCCCTTCCCC TGCCTGCTGC 
TATTCATTTG CTTTCTCGTT GAAACCTGTT 
AAAAAAAANA RAAAACNCGN GGGGQGGCCC 

30 



263 

CCCATTTTCC CTTTGAAAGG AAGACTTCTG 240 

GGGGAGTCCA AGCCTCTGTC ATGAGGAACG 300 

AATCCGCCCT CAGCTTGCAC GCCGGAAGCT 360 

GGCCTTCGAC TCGCTATGTC CACTAACAAT 420 

CTGAGGTACA AGCCCCCGCC GAGCGAATGT 480 

TACATGAACC TGCTGGGCAT GATCTTCAGC 540 

TGTGCTTGGG TCGCTGTCTA CTGCTCCTTC 600 

GACACGAAGC AAATGATGAG TAGCTTCATG 660 

CTGCAGAATC CTCAGCCCAT GACGCCCCCA 720 

TGGACCCTGT CTATCCACTA GGCCTGQGCT 780 

CATCCTGGAC TTCCCTGAAT GAGGCCGTCT 840 

GCCCTTTCCT AGGGAACACC CTAGGCTTAC 900 

TGGGGGAGAT GCTGTCCATG TTTCTAQGGG 960 

GTTAATAAAG TTTTTCACTC TGAAAAAAAA 1020 

GGAACCCAAT TCSCCGGATA GTGAGT 1076 



35 



(2) INFORMATION FOR SEQ ID MO; 115: 



(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH; 1487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

CCGCTGCTGA TAACTATGGC ATCCCCCGGG CCTGCAGGAA TTCGGCACGG AGCTACGGCG 60 

45 

CCGCCTGGCT CCTGCTGNCA CCTGCAGGCT CGTCGCGGGT GGAGCCCACC CAAGACATCA 120 

GCATCAGCGA CCAGCTGGGG GGCCAGGACG TGCCCGTGTT CCGGAACCTG TCCCTGCTGG 180 

50 TGGTGGGTGT CGGCGCCGTG TTCTCACTGC TATTCCACCT GGGCACCCGG GAGAGGCGCC 240 

GGCCGCATGC GGASGAGCCA GGCGAGCACA CCCCCCTGTT GGCCCCTGCC ACGGCCCAGC 300 

CCCTGCTGCT CTGGAAGCAC TGGCTCCGGG AGCSGGCTTT CTACCAGGTG GGCATACTGT 360 

55 

ACATGACCAC CAGGCTCATC GTGAACCTGT CCCAGACCTA CATGGCCATG TACCTCACCT 420 

ACTCGCTCCA CCTGCCCAAG AAGTTCATCG CGACCATTCC CCTGGTGATG TACCTCAGCG 480 

60 GCTTCTTGTC CTCCTTCCTC ATGAAGCCCA TCAACAAGTG CATTGGGAGG AACATGACCT 540 
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(2) INFORMATION FOR SEQ ID NO: 116: 



ACTTCTCAGG CCTCCTGGTG ATCCTGGCCT TTGCCGCCTG GGTGGCGCTG GCGGAGGGAC 600 

TQGGTGTGGC CGTGTACGCA GCGGCTGTGC TGCTGGGTGC TGGCTGTGCC ACCATCCTCG 660 

TCACCTCGCT GGCCATGACG GCCGACCTCA TCGGTCCCCA CACGAACAGC GGAGCKTTCG 720 

TGTACGGCTC CATGAGCTTC TTGGATAAGG TGGCCAATGG GCTGGCAGTC ATGGCCATCC 780 

10 AGAGCCTGCA CCXTTGCCCC TCAGAGCTCT GCTGCAGGGC CTGCGTGAGC TTTTACCACT 840 

GGGCGATGGT GGCTGTGACG GGCGGCGTGG GCGTGGCCGC TGCCCTGTGT CTCTGTAGCC 900 

^ TCCTGCTGTG GCCGACCCGC CTGCGACGCT GATGAGACCT GCACGCANTG GCTCACAGCA 960 

GCACGATTTG TGACAGCCCG AGGCGGAGAA CACCGAACAC CCAGTGAAGG TGAGGGGATC 1020 

AGCACGGCGC GGCCACCCAC GCACCCACGC GCTGGAATGA GACTCAGCCA CAAGGAGGTG 1080 

20 CGAAGCTCTG ACCCAGGCCA CAGTGCGGAT GCACCTTGAG GATGTCACGC TCAGTGAGAG 1140 

ACACCAGACA CAGAAGGGTA CGCTGTGATC CCACTTCTAT GAAATGTCCA GGACAGACCA 1200 

ATCCACAGAA TCAGGGAGAG GATTCGTGGG TGCOGGGACT GGGGAGGGGG ACCTGGGGGT 1260 

25 

GACTAGGTGA CATAATGGGG ACAGGGCTGC CTTCTGGGTG ATGAGAATGT TCTGGAATCA 1320 

GATGGGATGG CTGCACGGCG TGGTGAAGGT ACTGAACGCC ACCTCACTGT AAGACGGTAG 1380 

30 ATTTTGTATT TTACCACAAT AAACAAAACA AAACAAAACC AAAAAAAAAA AAAAAAAAAA 1440 

AAAAAAAAGG AATTCGATAT CAAGCTTATC GATACCGTCG ACCTCGA 1487 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 116: 

GGCACGAGTG CGCANGCGTG GGGCTCTCTC CTTGTCAGTC GGCGCCGCGT GCGGGCTGGT 60 

GGCTCTGTGG CAGCGGCGGC GGCAGGACTC CGGCACTATG AGCGGCTTCA GCACCGAGGA 120 

50 

GCGCGCOGCG CCTTCTCCCT GGAGTACCGA GTCTTCCTCA AAAATGAGAA AGGACAATAT 180 

ATATCTCCAT TTCATGATAT TCCAATTTAT GCAGATAAGG ATGTGTTTCA CATGGTAGTT 240 

55 GAAGTACCAC GCTGGTCTAA TGCAAAAATG GAGATTGCTA CAAAGGACCC TTTAAACCCT 300 

ATTAAACAAG ATGTGAAAAA AGGAAAACTT CGCTATGTTG CGAATTTGTT CCCGTATAAA 360 

GGATATATCT GGAACTATGG TGCCATCCCT CAGACTTGGG AAGACCCAGG GCACAATGAT 420 

60 
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AAACATACTG GCTGTTGTGG TGACAATGAC CCAATTGATG TGTGTGAAAT TGGAAGCAAG 480 

GTATGTGCAA GAGGTGAAAT AATTGGCGTG AAAGTTCTAG GCATATTGGC TATGATTGAC 540 

GAAGGGGAAA CCGACTGGAA AGTCATTGCC ATTAATGTGG ATGATCCTGA TGCAGCCAAT 600 

TATAATGATA TCAATGATGT CAAACGGCTG AAACCTGGCT ACTTAGAAGC TACTGTGGAC 660 

TGGTTTAGAA GGTATAAGGT TCCTGATGGA AAACCAGAAA ATGAGTTTGC GTTTAATGCA 720 

GAATTTAAAG ATAAGGACTT TGCCATTGAT ATTATTAAAA GCACTCATGA CCATTGGAAA 780 

GCATTAGTGA CTAAGAAAAC GAATGGAAAA GGAATCAGTT GCATGAATAC AACTTTGTCT 840 

15 GAGAGCCCCT TCAAGTGTGA TCCTGATGCT GCCAGAGCCA TTGTGGATGC TTTACCACCA 900 

CCCTGTGAAT CTGCCTGCAC AGTACCAACA GACGTGGATA AGTGGTTCCA TCACCAGAAA 960 

AACTAATGAG ATTTCTCTGG AATACAAGCT GATATTGCTA CATCGTGTTC ATCTGGATGT 1020 

20 

ATTAGAAGTA AAAGTAGTAG CTTTTCAAAG CTTTAAATTT GTAGAACTCA TCTAACTAAA 1080 

GTAAATTCTG CTGTGACTAA TCCAATATAC TCAGAATGTT ATCCATCTAA AGCATTTTTC 1140 

25 ATATCTCAAC TAAGATAACT TTTAGCACAT GCTTAAATAT CAAAGCAGTT GTCATTTGGA 1200 

AGTCACTTGT GAATAGATGT GCAAGGGGAG CACATATTGG ATGTATATGT TACCATATGT 1260 

TAGGAAATAA AATTATTTTG CTGAAAAAAA AAAAAAAAAA ACCTSGGGGG GGGSCCCGGT 1320 

CCCCATTTGG CCCTTTGGGG GGNGGTTTTA 1350 



30 



35 

(2) FORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2527 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

45 

CTCTTGCTAC CTTCCCGGCG CAGAGAACCC CGGCTGCTCA GCGCGCTCCG GGGTCATGGA 60 

GATCCCCGGG AGCCTGTGCA AGAAAGTCAA GCTGAGCAAT AACGCGCAGA ACTGGGGAAT 120 

50 GCAGAGAGCA ACCAATGTCA CCTACCAAGC CCATCATGTC AGCAGGAACA AGAGAGGTCA 180 

GGTGGTGGGG ACCAGAGGTG GCTTTCGTGG TTGCACAGTT TGGCTAACAG GCTTGTCTGG 240 

AGCGGGAAAG ACTACTGTGA GCATGGCCTT GGAGGAGTAC CTGGTTTGTC ATGGTATTCC 300 

55 

ATGCTACACT CTGGATGGTG ACAATA1TCG TCAAGGTCTC AATAAAAATC TTGGCTTTAG 360 

TCCTGAAGAC AGAGAAGAGA ATGTTCGACG CATCGCAGAA GTTGCTAAAC TGTTTGCAGA 420 

60 TGCTGGCTTA GTGTGCATCA CAAGTTTCAT ATCACCTTAC ACTCAGGATC GCAACAATGC 480 
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AAGGCAAATT CATGAAGGTG 
GCATGTTTGT GAACAGAGGG 

5 

TAAAGGTTTC ACTGGGATCG 
GAAAACAGAC TCCTGTGATG 
10 ACGGGATATT GTACCTGTGG 
TAAACTTCAT TTGGCAAAAA 
GGATATGCAG TGGGTGCAGG 

15 

GAGAGAGAGG GAGTACTTGC 
TAACTTGTCA GTACCTATAG 
20 CTGTACAGCA TTTGCTCTGA 
GTTTTTTGAG CACAGGAAAG 
CCACCCCTAT ATTAAGATGG 

25 

AGTCTTGGAT CGAGTTTATT 
GCTAAAGCAG AAATTTAAAG 
30 CCCAGTGCAC AATGGACATG 
GGGCTACCGG CGCCCTGTCC 
TGTTCCTTTG ATGTCGCGTA 

35 

TCCTGAGACG ACAGTGGTGG 
GGTCCAGTGG CATTGCAGAG 
40 AGACCCTGCT GGCATGCCTC 
TGCCAAAGTG CTGACGATGG 
TGCAGCTTAC AACAAGAAAA 

45 

CTTTGAATTT ATTTCAGGAA 
TGAAGGTTTC ATGGCTCCCA 
50 GAAAGCTTAG GCTGTTAACC 
GACCACATAG TCTCTGTTGG 
CAGACCATTT TCCTTAACTT 

55 

AGTGTAACAC ACTGATGGTT 
ATACAATTTT AAAATTGTCT 
60 AGCTGTTATA TTAGTTGTAA 



PCT/US98/053U 
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CAAGTTTACC GTTTTTTGAA GTATTTGTTG ATGCTCCTCT 540 

ATGTCAAAGG ACTCTACAAA AAAGCCCGGG CAGGAGAAAT 600 

ATTCTGAATA TGAAAAGCCA GAGGCCCCTG AGTTGGTGCT 660 

TAAATGACTG TGTCCAGCAA GTTGTGGAAC TTCTACAGGA 720 

ATGCATCTTA TGAAGTAAAA GAACTATATG TGCCAGAAAA 780 

CAGATGCGGA AACATTACCA GCACTGAAAA TTAATAAAGT 840 

TTTTQGCAGA AGGTTGGGCA ACCCCATTGA ATGGCTTTAT 900 

AGTGCCTTCA TTTTGATTGT CTTCTGGATG GAGGTGTCAT 960 

TTCTGACTGC GACTCATGAA GATAAAGAGA GGCTGGACGG 1020 

TGTATGAGGG CCGCCGTGTG GCCATTCTTC GCAATCCAGA 1080 

AGGAGCGCTG TGCCAGACAG TGGGGAACGA CATGCAAGAA 1140 

TGATGGAACA AGGAGATTGG CTGATTGGAG GAGATCTTCA 1200 

GGAATGATGG TCTTGATCAG TATCGTCTTA CTCCTACTGA 1260 

ATATGAATGC TGATGCTGTC TTTGCATTTC AACTACGCAA 1320 

CCCTGTTAAT GCAGGATACC CATAAGCAAC TTCTAGAGAG 1380 

TCCTCCTCCA CCCTCTGGGT GGCTGGACAA AGGATGACGA 1440 

TGAAGCAGCA TGCTGCAGTG TTGGAGGAAG GAGTTCTGAA 1500 

CCATCTTCCC ATCTCCCATG ATGTATGCTG GACCAACTGA 1560 

CACGGATGGT TGCAGGAGCC AACTTTTACA TTGTTGGACG 1620 

ATCCAGAAAC AGGGAAGGAT CTTTATGAGC CAAGTCATGG 1680 

CCCCTGGTTT AATCACTTTG GAAATAGTTC CCTTTCGAGT 1740 

AGAAGCGTAT GGACTACTAT GACTCTGAAC ACCATGAAGA 1800 

CACGAATGCG CAAACTTGCT CGAGAAGGCC AGAAACCACC 1860 

AGGCTTGGAC CGTGCTGACA GAATACTACA AATCCTTGGA 1920 

CAGTCACTCC ACCTTTGACA CATTACTAGT AACAAGAGGG 1980 

CATTTCTTTG TGGTGTCTGT CTGGACATGC TTCCTAAAAA 2040 

GCATCAGTTT TGGTCTGCCT TATGAGTTCT GTTTTGAACA 2100 

TTAATGTATC TTTTCCACTT ATTATAGTTA TATTCCTACA 2160 

TTTTATATTA TATTTATGCT TCTGTGTCAT GATTTTTTCA 2220 

CCAGTAGTAT TCACATTAAA TCTTGCTTTT TTTCCCCTTA 2280 
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AAAAAAGAAA AAAATTACCA AACAATAAAC TTGGCTAGAC CTTGTTTTGA GGATTTTACA 2340 

AGACCTCTGT AGCGATTAGA TTTTTTTTCT ACATTGAAAA TAGAAACTGC TTCCTTTCTT 2400 

5 

CTTTCCAGTC AGCTATTGGT CTTTCCAGCT GTTATAATCT AAAGTATTCT TATGATCTGT 2460 

GTAAGCTCTG AATGAACTTC TTTACTCAAT AAAATTAATT TTTTGGCTTC TTAAAAAAAA 2520 

10 AAAAAAA 2527 



15 (2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 

(B) TYPE: nucleic acid 
20 <C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

25 CGCATCACAG ACAACCCAGA AGGAAAATGG TTGGGCAGAA CAGCAAGGGG TTCATATGGC 60 

TATATTAAAA CAACTGCTGT AGAGATTNNC TATGATTCTT TGAAACTGAA AAAAGACTCT 120 

CTTGGTGCCC CTTCAAGACC TATTGAAGAT GACCAAGAAG TATATGATGA TGTTGCAGAG 180 

30 

CAGGATGATA TTAGCAGCCA CAGTCAGAGT GGAAGTGGAG GGATATTCCC TCCACCACCA 240 

GATGATGACA TTTATGATGG GATTGAAGAG GAAGATGCTG ATGATGGTTT CCCTGCTCCT 300 

35 CCTAAACAAT TGGACATGGG AGATGAAGTT TACGATGATG TGGATACCTC TGATTTCCCT 360 

GTTTCATCAG CAGAGATGAG TCAAGGAACT AATGTTGGAA AAGCTAAGAC AGAAGAAAAG 420 

GACCTTAAGA AGCTAAAAAA GCAGRAAAAA GAARAAAAAG ACTTCAGGAA AAAATTTAAA 480 

40 

TATGATGGTG AAA1TAGAGT CCTATATTCA ACTAAAGTTA CAACTTCCAT AACTTCTAAA 540 

AAGTGGGGAA CCAGAGATCT ACAGGTAAAA CCTGGTGAAT CTCTAGAAGT TATACAAACC 600 

45 ACAGATGACA CAAAAGTTCT CTGCAGAAAT GAAGAAGGGA AATATGGTTA TGTCCTTCGG 660 

AGTTACCTAG CGGACAATGA TGGAGAGATC TATGATGATA TTGCTGATGG CTGCATCTAT 720 

GACAATGACT AGCACTCAAC TTTGGTCATT CTGCTGTGTT CATTAGGTGC CAATGTGAAG 780 

50 

TCTGGATTTT AATTGGCATG TTATTGGGTA TCMAGAAAAT TAATGCACAR AACCACTTAT 840 

TATCATTTGT TATGAAATCC CAATTATCTT TACAAAGTGT TTAAAGTTTG AACATAGAAA 900 

55 ATAATCTCTC TGCTTAATTG TTATCTCAGA AGACTACATT AGTGAGATGT AAGAATTATT 960 

AAATATTCCA TTTCCGCTTT GGCTACAATT ATGAAGAAGT TGAAGGTACT TCTTTTAGAC 1020 

CACCAGTAAA TAATCCTCCT TCAAAAAATA AAAATAAAAA AAAAAAAAAA AAACTCGAGG 1080 

60 
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GGGGGCCCGG TACCCAAT 109Q 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1679 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

TCGACCCACG CGTCCGGCGA GATCCCTACC GCAGTAGCCG CCTCTGCCGC CGCGGAGCTT 60 

CCCGAACCTC TTCAGCCGCC CGGAGCCGCT CCCGGAGCCC GGCCGTAGAG GCTGCAATCG 120 

20 CAGCCGGGAG CCCGCAGCCC GCGCCCCGAG CCCGCCGCCG CCCTTCGAGG GCGCCCCAGG 180 

* CCGCGCCATG GTGAAGGTGA CGTTCAACTC CGCTCTGGCC CAGAAGGAGG CCAAGAAGGA 240 

^ CGAGCCCAAG AGCGGCGAGG AGGCGCTCAT CATCCCCCCC GACGCCGTCG CGGTGGACTG 300 

CAAGGACCCA GATGATGTGG TACCAGTTGG CCAAAGAAGA GCCTGGTGTT GGTGCATGTG 360 

CTTTGGACTA GCATTTATGC TTGCAGGTGT TATTCTAGGA GGAGCATACT TGTACAAATA 420 

30 TTTTGCACTT CAACCAGATG ACGTGTACTA CTGTGGAATA AAGTACATCA AAGATGATGT 480 

CATCTTAAAT GAGCCCTCTG CAGATGCCCC AGCTGCTCTC TACCAGACAA TTGAAGAAAA 540 

^ TATTAAAATC TTTGAAGAAG AAGAAGTTGA ATTTATCAGT GTGCCTGTCC CAGAGTTTGC 600 

AGATAGTGAT CCTGCCAACA TTGTTCATGA CTTTAACAAG AAACTTACAG CCTATTTAGA 660 

TCTTAACCTG GATAAGTGCT ATGTGATCCC TCTGAACACT TCCATTGTTA TGCCACCCAG 720 

40 AAACCTACTG GAGTTACTTA TTAACATCAA GGCTGGAACC TATTTGCCTC AGTCCTATCT 780 

GATTCATGAG CACATGGTTA TTACTGATCG CATTGAAAAC ATTGATCACC TGGGTTTCTT 840 

TATTTATCGA CTGTGTCATG ACAAGGAAAC TTACAAACTG CAACGCAGAG AAACTATTAA 900 

45 

AGGTATTCAG AAACGTGAAG CCAGCAATTG 1TTCGCAATT CGGCATTTTG AAAACAAATT 960 

TGCCGTGGAA ACTTTAATTT GTTCTTGAAC AGTCAAGAAA AACATTATTG AGGAAAATTA 1020 

50 ATATCACAGC ATAACCCCAC CCTTTACATT TTGTGCAGTG ATTATTTTTT AAAGTCTTCT 1080 

TTCATGTAAG TAGCAAACAG GGCTTTACTA TCTTTTCATC TCATTAATTC AATTAAAACC 1140 

^ ATTACCTTAA AATTTTTTTC TTTCGAAGTG TGGTGTCTTT TATATTTGAA TTAGTAACTG 1200 

TATGAAGTCA TAGATAATAG TACATGTCAC CTTAGGTAGT AGGAAGAATT ACAATTTCTT 1260 

TAAATCATTT ATCTGGATTT TTATGTTTTA TTAGCATTTT CAAGAAGACG GATTATCTAG 1320 

60 AGAATAATCA TATATATGCA TACGTAAAAA TGGACCACAG TGACTTATTT GTAGTTGTTA 1380 
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GTTGCCCTGC TACCTAGTTT GTTAGTGCAT TTGAGCACAC ATTTTAATTT TCCTCTAATT 
AAAATGTGCA GTATTTTCAG TGTCAAATAT ATTTAACTAT TTAGAGAATG ATTTCCACCT 
TTATGTTTTA ATATCCTAGG CATCTGCTGT AATAATATTT TAGAAAATGT 1TGGAATTTA 
AGAAATAACT TGTGTTACTA ATTTGTATAA CCCATATCTG TGCAATGGAA TATAAATATC 
ACAAAGTTGT TTAAMWAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAN 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

TTGGCANCNG GGAGAGGGAA AGAGGAGGAA ATGGGGTTTG AGGACCATGG CTTACCTTTC 

CTGCCTTTGA CCCATCACAC CCCATTTCCT CCTCTTTCCC TCTCCCCGCT GCCAAAAAAA 

AAAAAAAAGG AAACGTTTAT CATGAATCAA CAGGGTTTCA GTCCTTATCA AAGAGAGATG 

TGGAAAGAGC TAAAGAAACC ACCCTTTGTT CCCAACTCCA CTTTACCCAT ATTTTATGCA 

ACACAAACAC TGTCCTTTTG GGTCCCTTTC TTACAGATGG ACCTCTTGAG AAGAATTATC 

GTATTCCACG TTTTTAGCCC TCAGGTTACC AAGATAAATA TATGTATATA TAACCTTTAT 

TATTGCTATA TCTTTGTGGA TAATACATTC AGGTGGTGCT GGGTGATTTA TTATAATCTG 

AACCTAGGTA TATCCTTTGG TCTTCCACAG TCATGTTGAG GTGGGCTCCC TGGTATGGTA 

AAAAGCCAGG TATAATGTAA CTTCACCCCA GCCTTTGTAC TAAGCTCTTG ATAGTGGATA 

TACTCTTTTA AGTTTAGCCC CAATATAGGG TAATGGAAAT TTCCTGCCCT CTGGGTTCCC 

CATTTTTACT ATTAAGAAGA CCAGTGATAA TTTAATAATG CCACCAACTC TGGCTTAGTT 

AAGTGAGAGT GTGAACTGTG TGGCAAGAGA GCCTCACACC TCACTAGGTG CAGAGAGCCC 

AGGCCTTATC TTAAAATCAT GCACTTGAAA AGCAAACCTT AATCTGCAAA GACAGCAGCA 

AGCATTATAC GGTCATCTTG AATGATCCCT TTGAAATTTT TTTTTTGTTT GTTTGTTTAA 

ATCAAGCCTG AGGCTGGTGA ACAGTAGCTA CACACCCATA TTGTGTGTTC TGTGAATGCT 

AGCTCTCTTG AATTTGGATA TTGGTTATTT TTTATAGAGT GTAAACCAAG TTTTATATTC 

TGCAATGCGA ACAGGTACCT ATCTGTTTCT AAATAAAACT GTTTACATTC ATTATGGGGT 

ATGTATGACC TTCATTTTCC AAGAAATAGA ACTCTAGCTT AGAATTATGG ATGCTCTAAA 
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ATGTCAGAAT GGGAACTCTC CTCGAAGTTC TCCCAAACTC AGAGACAGCA CTGCCTTCTC 1140 

CTAAATGATT ATTCTTTTCT CCCTGTTTTC TGGTATTTTC TAGGCATCCT TCTCACCACA 1200 

GCCATAACCC TTTTTTACTT CCATTAGGCC GTATAACTGG NGGGACNGCT GGTCGGTATA 1260 

TAATACTGGT WCCAACAMAG GGGTTCTGGA TGTACACMAG GTTATCTT 1308 

(2) INFORMATION FOR SEQ ID NO: 121: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

GGCACAGGAG CGACCCGGGA GAAGGAGGGC CAMGAKGCGG AAGCGGAGGA GTCTCCAGGA 60 

^ GACCCGGGGA CAGCATCGCC CAGGCCCCTG TTTGCAGGCC TTTCAGATAT ATCCATCTCA 120 

CAAGACATCC CCGTAGAAGG AGAAATCACC ATTCCTATGA GATCTCGCAT CCGGGAGTTT 180 

GACAGCTCCA CATTAAATGA ATCTGTTCGC AATACCATCA TGCGTGATCT AAAAGCTGTT 240 

30 GGGAAAAAAT TCATGCATGT TTTGTACCCA AGGAAAAGTA ATACTCTTTT GAGAGATTGG 300 

GATTTGTGGG GCCCTTTGAT CCTTTGTGTG ACACTCGCAT TAATGCTGCA AAGAGACTCT 360 

^ GCAGATAGTG AAAAAGATGG AGGGCCCCAA TTTGCAGAGG TGTTTGTCAT TGTCTGGTTT 420 

GGTGCAGTTA CCATCACCCT CAACTCAAAA CTTCTTGGAG GGAACATATC TTTTTTTCAG 480 

AGCCTCTGTG TGCTGGGTTA CTGTATACTT CCCTTGACAG TAGCAATGCT GATTTGCCGG 540 

40 CTGGTACTTT TGGCTGATCC AGGACCTGTA AACTTCATGG TTCGGCITTT TGTGGTGA1T 600 

GTGATGTTTG CCTGGTCTAT AGTTGCCTCC ACAGCTTTCC TTGCTGATAG CCAGCCTCCA 660 

AACCGCAGAG CCCTAGCTGT TTATCCTGTT TTCCTGTTTT ACTTTGTCAT CAGTTGGATG 720 

45 

ATTCTCACCT TTACTCCTCA GTAAATCAGG AATGGGAAAT TAAAAACCAG TGAATTGAAA 780 

GCACATCTGA AAGATGCAAT TCACCATGGA GCTTTGTCTC TGGCCCTTAT TTGTCTAA1T 840 

50 TTGGAGGTAT TTGATAACTG AGTAGGTGAG GAGATTAAAA GGGAGCCATA TAGCACTGTC 900 

ACCCCTTATT TGAGGAACTG ATGTTTGAAA GGCTGTTCTT TTCTCTCTTA ATGTCATTTC 960 

^ TTTAAAAATA CATGTGCATA CTACACACAG TATATAATGC CTCCTTAAGG CATGATGGAG 1020 

TCACCGTGGT CCATTTGGGT GACAACCAGT GACTTGGGAA GCACATAGAT ACATCTTACA 1080 

AGTTGAATAG AGTTGATAAC TATTTTCAGT TTTGAGAATA CCAGTTCAGG TGCAGCTCTT 1140 

60 AAACACATTG CCTTATGACT ATTAGAATAT GCCTCTCTTT TCATAAATAA AAATACATGG 1200 
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TCTATATCCA TTTTCTTTTA TTTCTCTCTC TTAAGCTTAA AAAGGCAATG AGAGAGGTTA 1260 

GGAGTGGGTT CATACACGGA GAATGAGAAA ACATGCATTA ACCAATATTC AGATTTTGAT 1320 

CAGGGGAAAT TCTAYACTTG TTGCAAAAAA AAAAAAAAAA AAACTCGAGG GGGGCCCGGT 1380 

ACCCAATCGC NGTATATGAT CGNAAACAAT C 1411 

(2) INFORMATION FOR SEQ ID NO: 122: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GCTTTGGCTT TTTTTGGCGG ACTGGGGCGC CCTCCGGAAG CGTTTCCAAC TTTCCAGAAG 60 
25 TTTCTCGGGA CGGGCAGGAG GGGGTGGGGA CTGCCATATA TAGATCCCGG GAGCAGGGGA 120 
GCGGGCTAAG AGTAGAATCG TGTCGCGGCT CGAGAGCGAG AGTCACGTCC CGGCGCTAGC 180 
CAGCCCGACC CAGGCCCACC GTGGTGCACG CAAACCACTT CCTGGCCATG CGCTCCCTCC 240 

30 

TGCTTCTCAG CGCCTTCTGC CTCCTGGAGG CGGCCCTGGC CGCCGAGGTG AAGAAACCTG 300 
CAGCCGCAGC AGCTCCTGGC ACTGCGGAGA AGTTGAGCCC CAAGGCGGCC ACGCTTGCCG 360 
35 AGCGCANGCC GGCCTGGCCT TCAGCTTGTA CCAGGCCATG GCCAAGGACC AGGCAGTGGA 420 
GAACATCCTG GTGTCACCCG TGGTGGTGGC CTCGTCGCTG QGGCTCGTGT CGCTGGGCGG 480 
CAAGGCGACC ACGGCGTCGC AQGCCAAGGC AGTGCTGAGC GCCGAGCAGC TGCGCGACGA 540 

40 

GGAGGTGCAC GCCGGCCTGG GCGAGCTGCT GCGCTCACTC AGCAACTCGA CGGCGCGCAA 600 
CGTGACCTGG AAGCTGGGCA GCCGACTGTA CGGACCCAGC TCAGTGAGCT TCGCTGATGA 660 
45 CTTCGTGCGC ACAGCAAGCA GCACTACAAC TGCGAGCACT CCAAGATCAA CTTCCGCGAC 720 
AAGCGCAGNG CGCTGCAGTC CATCAACGAG TGGGCCGCGC AGACCACCGA CGGCAAGCTG 780 
CCCGAGGTCA CCAAGGACGT GGAGCGCACG GACGGCGCCC TGCTAGTCAA CGCCATGTTC 840 

50 

TTCAAGCCAC ACTGGGATGA GAAATTCCAC CACAAGATGG TGGACAACOG TCGCTTCATG 900 
GTGACTCGGT CCTATACYGT GGGTGTCATG ATGATGCACC GGACAGGCCT CTACAACTAC 960 
55 TACGACGACG AGAAGGAAAA GCTGCAAATC GTGGAGATGC CCCTGGCCCA CAAGCTCTCC 1020 
AGCCTCATCA TCCTCATGCC CCATCACGTG GAGCCTCTCG AGCGCCTTGA AAAGCTGCTA 1080 
ACCAAAGAGC AGCTGAAGAT CTGGATGGGG AAGATGCAGA AGAAGGCTGT TGCCATCTCC 1140 

60 
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TTGCCCAAGG GTGTGGTGGA GGTGACCCAT GACCTGCAGA AACACCTGGC TGGGCTGGGC 1200 

CTGACTGAGG CCATTGACAA GAACAAGGCC GACTTRTCAC GCATGTCAGG CAAGAAGGAC 1260 

5 CTGTACCTGG CCAGCGTGTT CCACGCCACC GCCTTTGAGT TGGACACAGA TGGCAACCCC 1320 

TTTGACCAGG ACATCTACGG GCGCGAGGAG CTGCGCANCC CAAGCTGTTC TACGCCGACC 1380 

ACCCCTTCAT CTTCCTAGTG CGGGACACCC AAAGCGGCTC CCTGCTATTC ATTGGGCGCC 1440 

10 

TGGTCCGGCC TAAGGGTGAC AAGATGCGAG ACGAGTTATA GGGCCTCAGG GTGCACACAG 1500 

GATGGCAGGA GGCATCCAAA GGCTCCTGAG ACACATGGGT GCTATTGGGG TTGGGGGGGA 1560 

15 GGTGAGGTAC CAGCCTTGGA TACTCCATGG GGTGGGGGTG GAAAARCAGA CCGGGGTTCC 1620 

CGTGTGCCTG AGCGGACCTT CCCAGCTAGA ATTCACTCCA CTTGGACATG GGCCCCAGAT 1680 

ACCATGATGC TGAGCCCGGA AACTCCACAT CCTGTGGGAC CTGGGCCATA GTCATTCTGC 1740 

20 

CTGCCCTGAA AGTCCCAGAT CAAGCCTGCC TCAATCAGTA TTCATATTTA TAGCCAGGTA 1800 

CCTTCTCACC TGTGAGACCA AATTGAGCTA GGGGGGTCAG CCAGCCCTCT TCTGACACTA I860 

25 AAACACCTCA GCTGCCTCCC CAGCTCTATC CCAACCTCTC CCAACTATAA AACTAGGTGC 1920 

TGCAGCCCCT GGGACCAGGC ACCCCCAGAA TGACCTGGCC GCAGTGAGGC GGATTGAGAA 1980 

GGAGCTCCCA GGAGGGGCTT CTGGGCAGAC TCTGGTCAAG AAGCATCGTG TCTGGCGTTG 2040 

30 

TGGGGATGAA CTTTTTGTTT TGTTTCTTCC TTTTTTAGTT CTTCAAAGAT AGGGAGGGAA 2100 

GGGGGAACAT GAGCCTTTGT TGCTATCAAT CCAAGAACTT ATTTGTACAT TTTTTTTTTC 2160 

35 AATAAAACTT TTCCAATGAC AAAAAAAAAA AAAAAAAAAA AAAAAGGGGS GGGCCGCTCC 2220 

TAGAGGGATC CCTCCGANGG NGCCCAATCG AAAATN 2256 

40 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 829 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

ATGCGCTCCC TCCTGCTTCT CAGCGCCTTC TGCCTCCTGG AGGCGGCCCT GGCCGCCGAG 60 

GTGAAGAAAC CTGCAGCCGC AGCAGCTCCT GGCACTGCGG AGAAGTTGAG CCCCAAGGCG 120 

55 

GCCACGCTTG CCGAGCGCAA GCGGCCTGGC CTTCAGCTTG TACCAGGCCA TGGCCAAGGA 180 

CCAGGCAGTG GAGAACATCC TGGTGTCACC CGTGGTGGTG GCCTCGTCGC TGGQGCTCGT 240 

60 GTCGCTGGGC GGCAAGGCGA CCACGGCGTC GCAGGCCAAG GCAGTGCTGA GCGCCGAGCA 300 
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GCTGCGCGAC GAGGAGGTGC ACGCCGGCCT QGGCGAGCTG CTGCGCTCAC TCAGCAACTC 360 

CACGGCGCGC AACGTGACCT GGAAGCTGGG CAGCCGACTG TACGGACCCA GCTCAGTGAG 420 

5 

CTTCGCTGAT GACTTCGTGC GCAGCAGCAA GCAGCACTAC AACTGCGAGC ACTCCAAGAT 480 

CAACTTCCGC GACAAGCGCA GCGCGCTGCA GTCCATCAAC GAGTGGGCCG CGCAGACCAC 540 

10 CGACGGCAAG CTGCCCGAGG TCACCAAGGA CGTGGAGCGC ACGGACGGCG CCCTGTTAGT 600 

CAACGCCATG TTCTTCAAGC CACACTGGGA TGAGAAATTC CACCACAAGA TGGTGGACAA 660 

CCGTGGCTTC ATGGTGACTC GGTCCTATAC CGTGGGTGTC ATGATGATGC ACCGGACAGG 720 

15 

CCTCTACAAC TACTACGACG ACGAGAAGGA AAAGCTGCAA ATCGTGGAGA TGCCCCTGGC 780 

CCACAAGCTC TCCAGCCTCA TCATCCTCAT GCCCCATCAC GTGGAGCCT 829 

20 



(2) INFORMATION FOR SEQ ID NO: 124: 

25 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
CCTCCGGAAG CGTTTCCAAC TTTCCAGAAG TTTCTCGGGA CGGGCAGGAG GGGGTGGGGA 60 
35 CTGCCATATA TAGATCCCGG GAGCAGGGGA GCGGGCTAAG AGTAGAATCG TGTCGCGGCT 120 
CGAGAGCGAG AGTCACGTCC CGGCGCTAGC CAGCCCGACC CAGGCCCACC GTGGTGCACG 180 
CAAACCACTT CCTGGCCATG CGCTCCCTCC TGCTTCTCAG CGCCTTCTGC CTCCTGGAGG 240 

40 

CGGCCCTGGC CGCCGAGGTG AAGAAACCTG CAGCCGCAGC AGCTCCTGGC ACTGCGGAGA 300 
AGTTGAGCCC CAAGGCGGCC ACGCTTGCCG AGCGCAGNCG GCCTGGCCTT CAGCTTGTAC 360 
45 CAGGCCATGG CCAAGGACCA GGCAGTGGAG AACATCCTGG TGTCACCCGT GGTGGTGGCC 420 
TCGTCGCTGG GGCTCGTGTC GCTGGGCGGC AAGGCGACCA CGGCGTCGCA GGCCAAGGCA 480 
GTGCTGAGCG CCGAGCAGCT GCGCGACGAG GAGGTGCACG CCGGCCTGGG CGAGCTGCTG 540 

50 

CGCTCACTCA GCAACTCSAC GGCGCGCAAC GTGACCTGGA AGCTGGGCAG CCGACTGTAC 600 
GGACCCAGCT CAGTGAGCTT CGCTGATGAC TTCGTGCGCA CAGCAAGCAG CACTACAACT 660 
55 GCGAGCACTC CAAGATCAAC TTCCGCGACA AGCGCACGCG CTGCAGTCCA TCAACGAGTG 720 
GGCCGCGCAG ACCACCGACG GCAAGCTGCC CGAGGTCACC AAGGACGTGG AGCGCACGGA 780 
CGGCGCCCTG YTAGTCAACG CCATGTTCTT CAAGCCACAC TGGGATGAGA AATTCCACCA 840 
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CAAGATGGTG GACAACCGTG GCTTCATGGT GACTCGGTCC TATACYGTGG GTGTCATGAT 900 

GATGCACCGG ACAGGCCTCT ACAACTACTA CGACGACGAG AAGGAAAAGC TGCAAATCGT 960 

5 GGAGATGCCC CTGGCCCACA AGCTCTCCAG CCTCATCATC CTCATGCCCC ATCACGTGGA 1020 

GCCTCTCGAG CGCCTTGAAA AGCTGCTAAC CAAAGAGCAG CTGAAGATCT GGATGGGGAA 1080 

GATGCAGAAG AAGGCTGTTG CCATCTCCTT GCCCAAGGGT GTGGTGGAGG TGACCCATGA 1140 

10 

CCTGCAGAAA CACCTGGCTG GGCTGGGCCT GACTGAGGCC ATTGACAAGA ACAAGGCCGA 1200 

CTTRTCACGC ATGTCAGGCA AGAAGGACCT GTACCTGGCC AGCGTGTTCC ACGCCACCGC 1260 

15 CTTTGAGTTG GACACAGATG GCAACCCCTT TGACCAGGAC ATCTACGGGC GCGAGGAGCT 1320 

GCGCASCCCA AGCTGTTCTA CGCCGACCAC CCCTTCATCT TCCTAGTGCG GGACACCCAA 1380 

AGCGGCTCCC TGCTATTCAT TGGGCGCCTG GTCCGGCCTA AGGGTGACAA GATGCGAGAC 1440 

20 

GAGTTATAGG GCCTCAGGGT GCACACAGGA TGGCAGGAGG CATCCAAAGG CTCCTGAGAC 1500 

ACATGGGTGC TATTGGGGTT GGGGGGGAGG TGAGGTACCA GCCTTGGATA CTCCATGGGG 1560 

25 TGGGGGTGGA AAARCAGACC GGGGTTCCCG TGTGCCTGAG CGGACCITCC CAGCTAGAAT 1620 

TCACTCCACT TGGACATGGG CCCCAGATAC CATGATGCTG AGCCCGGAAA CTCCACATCC 1680 

TGTGGGACCT GGGCCATAGT CATTCTGCCT GCCCTGAAAG TCCCAGATCA AGCCTGCCTC 1740 

30 

AATCAGTATT CATATTTATA GCCAGGTACC TTCTCACCTG TGAGACCAAA TTGAGCTAGG 1800 

GGGGTCAGCC AGCCCTCTTC TGACACTAAA ACACCTCAGC TGCCTCCCCA GCTCTATCCC 1860 

35 AACCTCTCCC AACTATAAAA CTAGGTGCTG CAGCCCCTGG GACCAGGCAC CCCCAGAATG 1920 

ACCTGGCCGC AGTGAGGCGG ATTGAGAAGG AGCTCCCAGG AGGGGCTTCT GGGCAGACTC 1980 

TGGTCAAGAA GCATCGTGTC TGGCGTTGTG GGGATGAACT TTTTGTTTTG TTTCTTCCTT 2040 

40 

TTTTAGTTCT TCAAAGATAG GGAGGGAAGG GGGAACATGA GCCTTTGTTG CTATCAATCC 2100 

AAGAACTTAT TTGTACATTT TTTTTTTCAA TAAAACTTTT CCAATGACAA AAAAAAAAAA 2160 

45 AAAAAAAAAA MWMGGGGSGG GCCGCTCCTA GAGGGATCCC TCCGANGGNG CCCAATCGAA 2220 

AAT 2223 



50 



(2) INFORMATION FOR SEQ ID NO: 125: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

60 Met Lys Lys Gin Ser Lys Arg Cys Leu Trp Lys Pro Pro Gly Ser Leu 
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10 



15 



Arg Arg Leu Trp Trp Met Arg Ala Leu Leu He Leu Lys Tyr He 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 126: 

10 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



15 



25 



40 



Met Lys Lys Ser Leu Glu Asn Leu Asn Arg Leu Gin Val Met Leu Leu 
15 10 15 



His Leu Thr Ala Ala Phe Leu Gin Arg Ala His Xaa He Leu Thr Thr 
20 20 25 30 



Arg Met Ser Leu Gly Phe Gin Ser Pro His Leu Thr Met 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 127: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

35 Met His Asn Gin Arg Gin Val Phe Leu Phe His Leu Phe Ser Asn Tyr 
15 10 15 



Leu Leu Ser He Asn Ser Val Pro Gly Thr Leu Leu Ala Ala Thr Tyr 
20 25 30 

Cys Leu Asn Met Thr Tyr Gly 
35 



45 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Met Arg Lys Lys Phe Leu Leu Ala Gin Val Phe Leu Ser Leu Ser Val 
55 1 5 10 15 

Met Pro Ser Met Pro Val Thr 
20 



60 
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(2) INFORMATION FOR SEQ ID NO; 129: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

10 Met Val Leu Leu Cys Leu Leu Leu Val Pro Leu Leu Leu Ser Leu Phe 
15 10 15 

Val Leu Gly Leu Phe Leu Trp Phe Leu Lys Arg Glu Arg Gin Glu Glu 
20 25 30 

15 

Tyr lie Glu Glu Lys Lys Arg Val Asp He Cys Arg Glu Thr Pro Asn 
35 40 45 

He Cys Pro His Ser Gly Glu Asn Thr Glu Tyr Asp Thr He Pro His 
20 50 55 60 

Thr Asn Arg Thr He Leu Lys Glu Asp Pro Ala Asn Thr Val Tyr Ser 
65 70 75 80 

25 Thr Val Glu lie Pro Lys Lys Met Glu Asn Pro His Ser Leu Leu Thr 

85 90 95 

Met Pro Asp Thr Pro Arg Leu Phe Ala Tyr Glu Asn Val He 
100 105 110 

30 



(2) INFORMATION FOR SEQ ID NO: 130; 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

40 

Met Leu Leu Leu Phe He Tyr Phe Tyr Ser His Pro Ala Pro Val Pro 
15 10 15 

Ala Gly Ala Thr Ser Lys Pro Arg Tyr Arg Val He Thr Cys Gly Pro 
45 20 25 30 

Ala Ser Val Phe Ser Thr Ser Phe Ser His Ser Pro Pro Ala Arg Cys 
35 40 45 

50 Leu Gly Arg Leu Glu Gin Met Phe His Phe Gly Leu Ala Ser Gly 
50 55 60 



55 (2) INFORMATION FOR SEQ ID NO: 131: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 



WO 98/42738 



PCT/US98/05311 



5 



10 



30 



40 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Met Pro Phe Pro lie Ser lie Leu Gin Leu Cys Leu Gin lie Ser Asn 
15 10 15 

Leu Ser Phe Cys Leu Gin Lys lie Tyr Lys lie Pro Phe Val 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 132: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 53 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Met Ala Ala Ala Cys Arg Ser Val Lys Gly Leu Val Ala Val He Thr 
20 1 5 10 15 

Gly Gly Ala Ser Gly Leu Gly Leu Ala Thr Ala Asp Asp Leu Trp Gly 
20 25 30 

25 Arg Glu Pro Leu Leu Cys Phe Trp Thr Cys Pro Thr Arg Val Gly Arg 
35 40 45 



Pro Lys Pro Arg Ser 
50 



(2) INFORMATION FOR SEQ ID NO: 133: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



Met Leu Leu Val Tyr Asp Leu Tyr Leu Xaa Pro Lys Leu Trp Ala Leu 
15 10 15 



Ala Thr Pro Gin Lys Asn Gly Lys Gly Ala Arg Xaa Gly Asp Gly Thr 
45 20 25 30 

Pro Ala Gin Ala Phe Trp Asp Phe Trp Ser His Leu lie Ser Ala Asp 
35 40 45 

50 Pro Gin Thr Trp Glu Arg Ala Ala Pro 
50 55 



55 (2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

Met Arg Leu Ser Ala Leu Leu Ala Leu Ala Ser Lys Val Thr Leu Pro 
15 10 15 

5 

Pro His Tyr Arg Tyr Gly Met Ser Pro Pro Gly Ser Val Ala Asp Lys 
20 25 30 

Arg Lys Asn Pro Pro Trp lie Arg Arg Arg Pro Val Val Val Glu Pro 
10 35 40 45 

lie Ser Asp Glu Asp Trp Tyr Leu Phe Cys Gly Asp Thr Val Glu lie 
50 55 60 

15 Leu Glu Gly Lys Asp Ala Gly Lys Gin Gly Lys Val Val Gin Val He 
65 70 75 80 

Arg Gin Arg Asn Trp Val Val Val Gly Gly Leu Asn Thr His Tyr Arg 
85 90 95 

20 

Tyr He Gly Lys Thr Met Asp Tyr Arg Gly Thr Met He Pro Ser Glu 
100 105 110 

Ala Pro Leu Leu His Arg Gin Val Lys Leu Val Asp Pro Met Asp Arg 
25 115 120 125 

Lys Pro Thr Glu He Glu Trp Arg Phe Thr Glu Ala Gly Glu Arg Val 
130 135 140 

30 Arg Val Ser Thr Arg Ser Gly Arg He He Pro Lys Pro Glu Phe Pro 
145 150 155 160 

Arg Ala Asp Gly He Val Pro Glu Thr Trp He Asp Gly Pro Lys Asp 
165 170 175 

35 

Thr Ser Val Glu Asp Ala Leu Glu Arg Thr Tyr Val Pro Cys Leu Lys 
180 185 190 

Thr Leu Gin Glu Glu Val Met Glu Ala Met Gly He Lys Glu Thr Arg 
40 195 200 205 

Lys Tyr Lys Lys Val Tyr Trp Tyr 
210 215 

45 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

55 Met Ser Leu Arg Gin Lys Ser Ser Phe Arg Leu Met Val Met Ser Leu 
15 10 15 



60 



Thr He Leu Lys Leu Ser Lys Thr Thr Val Leu Cys Leu Arg Cys Leu 
20 25 30 
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His Ser Leu Lys Leu Thr Trp Arg Asp Gly Ala Arg Cys lie Asn Ala 
35 40 45 

Glu 

5 



10 



25 



(2) INFORMATION FOR SEQ ID NO: 136: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Met Ser Gly Ser Phe lie Leu Cys Leu Ala Leu Val Thr Arg Trp Ser 
15 10 15 

20 Pro Gin Ala Ser Ser Val Pro Leu Ala Val Tyr Glu Ser Lys Thr Arg 
20 25 30 



Lys Ser Tyr Arg Ser Gin Arg Asp Arg Asp Gly Lys Asp Arg Ser Gin 

35 40 45 

Gly Met Gly Leu Ser Leu Leu Val Glu Thr Arg Lys Leu Leu Leu Ser 
50 55 60 



Ala Asn Gin Gly 
30 65 



35 



(2) INFORMATION FOR SEQ ID NO: 137: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Met Cys Phe Arg Phe Phe Leu Phe Cys Ser Arg lie Leu Leu Lys Leu 
15 10 15 

45 Phe Phe Leu Leu Phe Pro Ala Ser Ala Phe Pro Leu Ser Thr Arg Ser 
20 25 30 



50 



Ser Leu Ser Val Asn Glu His Val Val Val Ser Pro Arg Ser Thr Val 
35 40 45 

Ser lie Ser Arg 
50 



55 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 138: 

Met Val Arg Thr Asp Gly His Thr Leu Ser Glu Lys Arg Asn Tyr Gin 
5 1 5 10 15 

Val Thr Asn Ser Met Phe Gly Ala Ser Arg Lys Lys Phe Val Glu Gly 
20 25 30 

10 Val Asp Ser Asp Tyr His Asp Glu Asn Met Tyr Tyr Ser Gin Ser Ser 
35 40 45 

Met Phe Pro His Arg Ser Glu Lys Asp Met Leu Ala Ser Pro Ser Thr 
50 55 60 

15 

Ser Gly Gin Leu Ser Gin Phe Gly Ala Ser Leu Tyr Gly Gin Gin Ser 
65 70 75 80 

Ala Leu Gly Leu Pro Met Arg Gly Met Ser Asn Asn Thr Pro Gin Leu 
20 85 90 95 

Asn Arg Ser Leu Ser Gin Gly Thr Gin Leu Pro Ser His Val Thr Pro 
100 105 110 

25 Thr Thr Gly Val Pro Thr Met Ser Leu His Thr Pro Pro Ser Pro Ser 
115 120 125 

Arg Gly lie Leu Pro Met Asn Pro Xaa Asn Met Met Asn His Ser Gin 
130 135 140 

30 

Val Gly Gin Gly lie Gly lie Pro Ser Arg Thr Asn Ser Met Ser Ser 
145 150 155 160 

Ser Gly Leu Gly Ser Pro Asn Arg Ser Ser Pro Ser lie lie Cys Met 
35 165 170 175 

Pro Lys Gin Gin Pro Ser Arg Gin Pro Phe Thr Val Asn Ser Met Ser 
180 185 190 

40 Gly Phe Gly Met Asn Arg Asn Gin Ala Phe Gly Met Asn Asn Ser Leu 
195 200 205 

Ser Ser Asn lie Phe Asn Gly Thr Asp Gly Ser Glu Asn Val Thr Gly 
210 215 220 

45 

Leu Asp Leu Ser Asp Phe Pro Ala Leu Ala Asp Arg Asn Arg Arg Glu 
225 230 235 240 

Gly Ser Gly Asn Pro Thr Pro Leu lie Asn Pro Leu Ala Gly Arg Ala 
50 245 250 255 

Pro Tyr Val Gly Met Val Thr Lys Pro Ala Asn Glu Gin Ser Gin Asp 
260 265 270 

55 Phe Ser lie His Asn Glu Asp Phe Pro Ala Leu Pro Gly Ser Ser Tyr 
275 280 285 

Lys Asp Pro Thr Ser Ser Asn Asp Asp Ser Lys Ser Asn Leu Asn Thr 
290 295 300 

60 
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Ser Gly Lys Thr Thr Ser Ser Thr Asp Gly Pro Lys Phe Pro Gly Asp 
305 310 315 320 

Lys Ser Ser Thr Thr Gin Asn Asn Asn Gin Gin Lys Lys Gly lie Gin 
5 325 330 335 

Val Leu Pro Asp Gly Arg Val Thr Asn He Pro Gin Gly Met Val Thr 
340 345 350 

10 Asp Gin Phe Gly Met He Gly Leu Leu Thr Phe He Arg Ala Ala Glu 
355 360 365 



15 



30 



Thr Asp Pro Gly Met Val His Leu Ala Leu Gly Ser Asp Leu Thr Thr 

370 375 380 

Leu Gly Leu Asn Leu Asn Ser Pro Glu Asn Leu Tyr Pro Lys Phe Ala 

385 390 395 400 



Ser Pro Trp Ala Ser Ser Pro Cys Arg Pro Gin Asp He Asp Phe His 
20 405 410 415 

Val Pro Ser Glu Tyr Leu Thr Asn He His He Arg Asp Lys Leu Ala 
420 425 430 

25 Ala He Lys Leu Gly Arg Tyr Gly Glu Asp Leu Leu Phe Tyr Leu Tyr 
435 440 445 



Tyr Met Asn Gly Gly Asp Val Leu Gin Leu Leu Ala Ala Val Glu Leu 
450 455 460 

Phe Asn Arg Asp Trp Arg Tyr His Lys Glu Glu Arg Val Trp He Thr 
465 470 475 480 



Arg Ala Pro Gly Met Glu Pro Thr Met Lys Thr Asn Thr Tyr Glu Arg 
35 485 490 495 

Gly Thr Tyr Tyr Phe Phe Asp Cys Leu Asn Trp Arg Lys Val Ala Lys 
500 505 510 

40 Glu Phe His Leu Glu Tyr Asp Lys Leu Glu Glu Arg Pro His Leu Pro 
515 520 525 



45 



Ser Thr Phe Asn Tyr Asn Pro Ala Gin Gin Ala Phe Xaa 
530 535 540 



(2) INFORMATION FOR SEQ ID NO: 139: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

55 

Met He Cys Pro Gin Cys Pro Leu Ser Leu Leu Cys Leu He Ser Ser 
1 5 10 15 



Leu Cys Ser Leu Val He Gin He Ser Leu Lys Thr He Arg Asp He 
60 20 25 30 
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Thr Leu Leu Asn Met Val Gly He Lys Phe Ser He Ser Leu Ser Asn 
35 40 45 



Lys He Asn He Asn Ser Arg Thr Trp Xaa 
50 55 



10 (2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

Met Thr Leu Arg Pro Ser Leu Leu Pro Leu His Leu Leu Leu Leu Leu 
15 10 15 

20 

Leu Leu Ser Ala Ala Val Cys Arg Ala Glu Ala Gly Leu Glu Thr Glu 
20 25 30 

Ser Pro Val Arg Thr Leu Gin Val Glu Thr Leu Val Glu Pro Pro Glu 
25 35 40 45 

Pro Cys Ala Glu Pro Ala Ala Phe Gly Asp Thr Leu His He His Tyr 
50 55 60 

30 Thr Gly Ser Leu Val Asp Gly Arg He He Asp Thr Ser Leu Thr Arg 
65 70 75 80 

Asp Pro Leu Val He Glu Leu Gly Gin Lys Gin Val He Pro Gly Leu 
85 90 95 

35 

Glu Gin Ser Leu Leu Asp Met Cys Val Gly Glu Lys Arg Arg Ala He 
100 105 HO 

He Pro Ser His Leu Ala Tyr Gly Lys Arg Gly Phe Pro Pro Ser Val 
40 115 120 125 

Pro Ala Asp Ala Val Val Gin Tyr Asp Val Glu Leu He Ala Leu He 
130 135 140 

45 Arg Ala Asn Tyr Trp Leu Lys Leu Val Lys Gly He Leu Pro Leu Val 
145 150 155 160 

Gly Met Ala Met Val Pro Ala Leu Leu Gly Leu He Gly Tyr His Leu 
165 170 175 

50 

Tyr Arg Lys Ala Asn Arg Pro Lys Val Ser Lys Lys Lys Leu Lys Glu 
180 185 190 

Glu Lys Arg Asn Lys Ser Lys Lys Lys Xaa 
55 195 200 



(2) INFORMATION FOR SEQ ID NO: 141: 

60 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Met Phe Leu Arg Leu Tyr Leu lie Ala Arg Val Met Leu Leu His Ser 
15 10 15 

10 Lys Leu Phe Thr Asp Ala Ser Ser Arg Ser lie Gly Ala Leu Asn Lys 
20 25 30 

lie Asn Phe Asn Thr Arg Phe Val Met Lys Thr Leu Met Thr lie Cys 
35 40 45 

15 

Pro Gly Thr Val Leu Leu Val Phe Ser He Ser Leu Trp He He Ala 
50 55 60 

Ala Trp Thr Val Arg Val Cys Glu Ser Pro Glu Ser Pro Ala Gin Pro 
20 65 70 75 80 

Ser Gly Ser Ser Leu Pro Ala Trp Tyr His Asp Gin Gin Asp Val Thr 
85 90 95 

25 Ser Asn Phe Leu Gly Ala Met Trp Leu He Ser He Thr Phe Leu Ser 
100 105 110 

He Gly Tyr Gly Asp Met Val Pro His Thr Tyr Cys Gly Lys Gly Val 
115 120 125 

30 

Cys Leu Leu Thr Gly He Met Gly Ala Gly Cys Thr Ala Leu Val Val 
130 135 140 

Ala Val Val Ala Arg Lys Leu Glu Leu Thr Lys Ala Glu Lys His Val 
35 145 150 155 160 

His Asn Phe Met Met Asp Thr Gin Leu Thr Lys Arg He Lys Asn Ala 
165 170 175 

40 Ala Ala Asn Val Leu Arg Glu Thr Trp Leu He Tyr Lys His Thr Lys 
180 185 190 

Leu Leu Lys Lys He Asp His Ala Lys Val Arg Lys His Gin Arg Lys 
195 200 205 

45 

Phe Leu Pro Ser Tyr Pro Pro Val Xaa 
210 215 



50 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



Met Ser Asn Thr Thr Val Pro Asn Ala Pro Gin Ala Asn Ser Asp Ser 
60 1 5 10 15 
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10 



Met Val Gly Tyr Val Leu Gly Pro Phe Phe Leu lie Thr Leu Val Gly 
20 25 30 

Val Val Val Ala Val Val Met Tyr Val Gin Lys Lys Lys Arg Val Asp 
35 40 45 

Arg Leu Arg His His Leu Leu Pro Met Tyr Ser Tyr Asp Pro Ala Glu 
50 55 60 

Glu Leu His Glu Ala Glu Gin Glu Leu Leu Ser Asp Met Gly Asp Pro 
65 70 75 80 

Lys Val Val His Gly Trp Gin Ser Gly Tyr Gin His Lys Arg Met Pro 



15 85 90 



95 



20 



Leu Leu Asp Val Lys Thr 
100 



(2) INFORMATION FOR SEQ ID NO: 143: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

30 Met Arg Glu Cys Gin Glu Glu Ser Phe Trp Lys Arg Ala Leu Pro Phe 
15 10 15 

Ser Leu Val Ser Met Leu Val Thr Gin Gly Leu Val Tyr Gin Gly Tyr 
20 25 30 

35 

Leu Ala Ala Asn Ser Arg Phe Gly Ser Leu Pro Lys Val Ala Leu Ala 
35 40 45 

Gly Leu Leu Gly Phe Gly Leu Gly Lys Val Ser Tyr He Gly Val Cys 
40 50 55 60 

Gin Ser Lys Phe His Phe Phe Glu Asp Gin Leu Arg Gly Ala Gly Phe 
65 70 75 80 

45 Gly Pro Gin His Asn Arg His Cys Leu Leu Thr Cys Glu Glu Cys Lys 

85 90 95 

He Lys His Gly Leu Ser Glu Lys Gly Asp Ser Gin Pro Ser Ala Ser 
100 105 HO 

50 



55 

<2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

Met Lys Asn Asp Arg Asn Gin Gly Phe Ser Leu Leu Gin Leu lie Asp 
1 5 10 15 

Trp Asn Lys Pro 
20 



(2) INFORMATION FOR SEQ ID NO: 145: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

20 Met Gly Thr Gin Pro Pro Val Val Ala Gly Phe Thr He Pro Met Leu 
15 10 15 



25 



Gly Tyr Thr Val Arg Val Leu Thr Phe His Leu Ser Cys Ser 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 146: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

35 

Met Lys He Pro Val Leu Pro Ala Val Val Leu Leu Ser Leu Leu Val 
15 10 15 

Leu His Ser Ala Gin Gly Ala Thr Leu Gly Gly Pro Glu Glu Glu Ser 
40 20 25 30 

Thr He Glu Asn Tyr Ala Ser Arg Pro Glu Ala Phe Asn Thr Pro Phe 
35 40 45 

45 Leu Asn He Asp Lys Leu Arg Ser Ala Phe Lys Ala Asp Glu Phe Leu 
50 55 60 

Asn Trp His Ala Leu Phe Glu Ser He Lys Arg Lys Leu Pro Phe Leu 
65 70 75 80 

50 

Asn Trp Asp Ala Phe Pro Lys Leu Lys Gly Leu Arg Ser Ala Thr Pro 
85 90 95 

Asp Ala Gin 

55 



(2) INFORMATION FOR SEQ ID NO: 147: 

60 
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10 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

Met Val Trp Gly Leu Leu Leu Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 148: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

20 Met Leu Pro Leu Leu Ser Leu Leu Phe Leu Phe Phe Ser Thr Val Ser 
X 5 10 15 

Ser Phe Cys Gly Met Pro Leu Arg Ala His Thr Arg Ala Xaa Ala His 
20 25 30 



Thr Arg Thr Phe Ala Ser Arg 
35 



30 

(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Met lie Cys Glu Thr Lys Ala Arg Lys Ser Ser Gly Gin Pro Gly Arg 
40 1 5 10 15 

Leu Pro Pro Pro Thr Leu Ala Pro Pro Gin Pro Pro Leu Pro Glu Thr 
20 25 30 

45 lie Glu Arg Pro Val Gly Thr Gly Ala Met Val Ala Arg Ser Ser Asp 
35 40 45 

Leu Pro Tyr Leu lie Val Gly Val Val Leu Gly Ser lie Val Leu lie 
50 55 60 

50 

He Val Thr Phe He Pro Phe Cys Leu Trp Arg Ala Trp Ser Lys Gin 
65 70 75 80 

Lys His Thr Thr Asp Leu Gly Phe Pro Arg Ser Ala Leu Pro Pro Ser 
55 85 90 95 

Cys Pro Tyr Thr Met Val Pro Leu Gly Gly Leu Pro Gly His Gin Ala 
100 105 110 

60 Val Asp Ser Pro Thr Ser Val Ala Ser Val Asp Gly Pro Val Leu Met 
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115 



120 



125 



Gly Ser Thr 
130 



(2) INFORMATION FOR SEQ ID NO: 150: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

15 

Met Gly Ala Pro Ser Leu Thr Met Leu Leu Leu Leu Lys Val Gin Pro 
1 5 10 15 

Arg Arg Thr Gin Ala Phe Asp Ala His Trp Val Gly Leu Pro Leu Leu 
20 20 25 30 



25 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151; 

35 Met Cys Leu lie Phe Leu Leu Leu Leu Leu Leu Ser Phe Ser 
15 10 



40 (2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

His Pro His Gin Asp Ser Gin Pro 
1 5 

50 



(2) INFORMATION FOR SEQ ID NO: 153: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
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Met Asn Thr Ser Tyr lie Leu Arg Leu Thr Val Val Val Ser Val Val 
15 10 15 

lie Tyr Leu Ala lie His Pro Leu Leu Ser Phe Ser Leu Glu Ser Pro 
5 20 25 30 

Leu Leu Val Pro Trp Arg Asp Cys Cys Gin Asn lie Trp Lys Ser Gly 
35 40 45 

10 Ser Val Trp Tyr Lys Arg Trp Thr Leu Pro His Met Glu Val Cys Cys 
50 55 60 

Gin Asp Leu His 
65 

15 



(2) INFORMATION FOR SEQ ID NO: 154: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

25 

Met Leu Lys lie Phe Lys Glu Trp Glu Asn Leu Asn Leu He Leu Thr 
15 10 15 

Ser He Arg He Leu Glu Arg Gin Asn Met 
30 20 25 



(2) INFORMATION FOR SEQ ID NO: 155: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

Met Asp Cys Glu Val Asn Asn Gly Ser Ser Leu Arg Asp Glu Cys He 
15 10 15 

45 Thr Asn Leu Leu Val Phe Gly Phe Leu Gin Ser Cys Ser Asp Asn Ser 
20 25 30 

Phe Arg Arg Glu Leu Asp Ala Leu Gly His Glu Leu Pro Val Leu Ala 
35 40 45 

50 

Pro Gin Trp Glu Gly Tyr Asp Glu Leu Gin Thr Asp Gly Asn Arg Ser 
50 55 60 

Ser His Ser Arg Leu Gly Arg He Glu Ala Asp Ser Glu Ser Gin Glu 
55 65 70 75 80 

Asp He He Arg Asn He Ala Arg His Leu Ala Gin Val Gly Asp Ser 
85 90 95 

60 Met Asp Arg Ser He Pro Pro Gly Leu Val Asn Gly Leu Ala Leu Gin 
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100 



105 



110 



lieu Arg Asn Thr Ser Arg Ser Glu Glu Asp Arg Asn Arg Asp Leu Ala 
115 120 125 

5 

Thr Ala Leu Glu Gin Leu Leu Gin Ala Tyr Pro Arg Asp Met Glu Lys 
130 135 140 

Glu Lys Thr Met Leu Val Leu Ala Leu Leu Leu Ala Lys Lys Val Ala 
10 145 150 155 160 

Ser His Thr Pro Ser Leu Leu Arg Asp Val Phe His Thr Thr Val Asn 
165 170 175 

15 Phe lie Asn Gin Asn Leu Arg Thr Tyr Val Arg Ser Leu Ala Arg Asn 
180 185 190 



20 



Gly Met Asp 
195 



(2) INFORMATION FOR SEQ ID NO: 156: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

30 

Met Ser Leu Ser Leu Val Ser Val Ser Val Gly Pro Ser Thr Leu Ala 
15 10 15 

Cys Ser Phe Leu Arg Pro Lys Ala Arg Pro Ser Lys Arg Ser Pro Arg 
35 20 25 30 

Asn Tyr Thr Asp Ser Thr Ser Pro Gly Gly Pro Arg Ala Pro Arg Gly 
35 40 45 

40 Gly Ala Trp Arg Leu Ser Ser Gin Gin Asn Ser Ser Pro Lys Gly Val 
50 55 60 

Ala Val Ala Lys Ala Ser Tyr Arg Pro Val Leu Cys Phe Leu Pro Gly 
65 70 75 80 

45 

Pro Trp Ser Ser Xaa Pro Xaa Ala Phe Leu lie 
85 90 



50 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

Met Gly Thr Leu Ser Ala Glu Cys Ser Gly Pro Ala Thr Leu Gly Leu 
60 1 5 10 15 
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Cys Leu Val Val Pro Trp Asn Ser Ser Gly Leu Ser Gin Pro Pro 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

15 Met Lys Phe Leu Ala Val Leu Val Leu Leu Gly Val Ser lie Phe Leu 
15 10 15 



20 



Val Ser Ala Gin Asn Pro Thr Thr Ala Ala Pro Ala Asp Thr Tyr Pro 
20 25 30 

Ala Thr Gly Pro Ala Asp Asp Glu Ala Pro Asp Ala Glu Thr Thr Ala 
35 40 45 



Ala Ala Thr Thr Ala Thr Thr Ala Ala Pro Thr Thr Ala Thr Thr Ala 
25 50 55 60 

Ala Ser Thr Thr Ala Arg Lys Asp lie Pro Val Leu Pro Lys Trp Val 
65 70 75 80 

30 Gly Asp Leu Pro Asn Gly Arg Val Cys Pro Xaa 

85 90 



35 (2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



45 



Met He He Ser Leu Phe He Tyr He Phe Leu Thr Cys Ser Asn Thr 
15 10 15 

Ser Pro Ser Tyr Gin Gly Thr Gin Leu Gly Leu Gly Leu Pro Ser Ala 
20 25 30 



Gin Trp Trp Pro Leu Thr Gly Arg Arg Met Gin Cys Cys Arg Leu Phe 
50 35 40 45 

Cys Phe Leu Leu Gin Asn Cys Leu Phe Pro Phe Pro Leu His Leu He 
50 55 60 

55 Gin His Asp Pro Cys Glu Leu Val Leu Thr He Ser Trp Asp Trp Ala 
65 70 75 80 



60 



Glu Ala Gly Ala Ser Leu Tyr Ser Pro 
85 
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10 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 160: 

\i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 160: 

Met Ser Ser Ala Ala Ala Asp His Trp Ala Trp Leu Leu Val Leu Ser 
15 10 15 



Phe Val Phe Gly Cys Asn Val Leu Arg He Leu Leu Pro Ser Phe Ser 
15 20 25 30 

Ser Phe Met Ser Arg Val Leu Gin Lys Asp Ala Glu Gin Glu Ser Gin 
35 40 45 

20 Met Arg Ala Glu He Gin Asp Met Lys Gin Glu Leu Ser Thr Val Asn 
50 55 60 



Met Met Asp Glu Phe Ala Arg Tyr Ala Arg Leu Glu Arg Lys He Asn 
65 70 75 80 

Lys Met Thr Asp Lys Leu Lys Thr His Val Lys Ala Arg Thr Ala Gin 
85 90 95 



Leu Ala Lys He Lys Trp Val He Ser Val Ala Phe Tyr Val Leu Gin 
30 100 105 110 

Ala Ala Leu Met He Ser Leu He Trp Lys Tyr Tyr Ser Val Pro Val 
115 120 125 

35 Ala Val Val Pro Ser Lys Trp He Thr Pro Leu Asp Arg Leu Val Ala 
130 135 140 



Phe Pro Thr Arg Val Ala Gly Gly Val Gly He Thr Cys Trp He Leu 
145 150 155 160 

Val Cys Asn Lys Val Val Ala He Val Leu His Pro Phe Ser 
165 170 



45 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 
50 (B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

Met Gly Lys Leu He Asn He Val lie Arg Lys Pro Leu Leu Leu Leu 
55 1 5 10 15 

Leu Val Gin Cys Glu Asn Cys Cys Arg Lys Asn Met Leu Tyr Asn He 
20 25 30 

60 Phe Leu Asn He His Asn He His Lys Phe Ser Asn His 
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35 40 45 



5 (2) INFORMATION FOR SEQ ID NO; 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
<B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 



15 



20 



40 



45 



Met Val Ala Ser Thr Leu Val Thr Asn Leu Phe Gly Val Ala Phe Ala 
15 10 15 

Thr Thr Ala Ala Thr Arg Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 163: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

Met Leu Met Ala Pro Val Val Cys Leu Ser Phe Ser Pro Cys Pro Ala 
30 1 5 10 15 

Asp Thr Ser Leu Thr Gly Asp Gly Leu Lys Ala Gly Leu Glu Arg Gly 
20 25 30 

35 Xaa Ala Leu Val Thr Leu Phe Asp Ser Val Thr His Phe Leu Ala His 
35 40 45 



Thr Leu Phe Glu Leu Leu Asp Phe Gin Leu Ala Phe Leu Arg Ser Gly 
50 55 60 

Lys Gin Thr Ala Pro His 
65 70 



(2) INFORMATION FOR SEQ ID NO: 164: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Met Leu Leu Leu Leu Leu Leu Leu Gly Ser Gly Gin Gly Pro Gin Gin 
55 1 5 10 15 

Val Gly Ala Gly Gin Thr Phe Glu Tyr Leu Lys Arg Glu His Ser Leu 
20 25 30 



60 Ser Lys Pro Tyr Gin Gly Val Gly Thr Gly Ser Ser Ser Leu Trp Asn 
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35 



40 



45 



Leu Met Gly Asn Ala Met Val Met Thr Gin Tyr lie Arg Leu Thr Pro 
50 55 60 

5 

Asp Met Gin Ser Lys Gin Gly Ala Leu Trp Asn Arg Val Pro Cys Phe 
65 70 75 80 

Leu Arg Asp Trp Glu Leu Gin Val His Phe Lys He His Gly Gin Gly 
10 85 90 95 

Lys Lys Asn Leu His Gly Asp Gly Leu Ala He Trp Tyr Thr Arg Asn 
100 105 110 

15 Arg Met Gin Pro Gly Pro Val Phe Gly Asn Met Asp Lys Phe Val Gly 
115 120 125 



20 



Leu Gly Val Phe Val Asp Thr Tyr Pro Asn Glu Glu Lys Gin Gin Glu 
130 135 140 

Arg Val Phe Pro Tyr He Ser Ala Met Val Asn Asn Gly Ser Leu Ser 
145 150 155 160 



Tyr Asp His Glu Arg Asp Gly Arg Pro Thr Glu Leu Gly Gly Cys Thr 
25 165 170 175 

Ala He Val Arg Asn Leu His Tyr Asp Thr Phe Leu Val He Arg Tyr 
180 185 190 

30 Val Lys Arg His Leu Thr He Met Met Asp He Asp Gly Lys His Glu 
195 200 205 



35 



Trp Arg Asp Cys He Glu Val Pro Gly Val Arg Leu Pro Arg Gly Tyr 

210 215 220 

Tyr Phe Gly Thr Ser Ser He Thr Gly Asp Leu Ser Asp Asn His Asp 

225 230 235 240 



Val He Ser Leu Lys Leu Phe Glu Leu Thr Val Glu Arg Thr Pro Glu 
40 245 250 255 

Glu Glu Lys Leu His Arg Asp Val Phe Leu Pro Ser Val Asp Asn Met 
260 265 270 

45 Lys Leu Pro Glu Met Thr Ala Pro Leu Pro Pro Leu Ser Gly Leu Ala 
275 280 285 



50 



Leu Phe Leu He Val Phe Phe Ser Leu Val Phe Ser Val Phe Ala He 
290 295 300 

Val He Gly He He Leu Tyr Asn Lys Trp Gin Glu Gin Ser Arg Lys 

305 310 315 320 



55 



Arg Phe Tyr 



(2) INFORMATION FOR SEQ ID NO: 165: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

Met Pro Ser Glu Tyr Thr Tyr Val Lys Leu Arg Ser Asp Cys Ser Arg 
15 10 15 

10 Pro Ser Leu Gin Trp Tyr Thr Arg Ala Gin Ser Lys Met Arg Arg Pro 
20 25 30 



15 



30 



45 



Ser Leu Leu Leu Lys Asp lie Leu Lys Cys Thr Leu Leu Val Phe Gly 
35 40 45 

Val Trp lie Leu Tyr lie Leu Lys Leu Asn Tyr Thr Thr Glu Glu Cys 
50 55 60 



Asp Met Lys Lys Met His Tyr Val Asp Pro Asp His Val Lys Arg Ala 
20 65 70 75 80 

Gin Lys Tyr Ala Gin Gin Val Leu Gin Lys Glu Cys Arg Pro Lys Phe 
85 90 95 

25 Ala Lys Thr Ser Met Ala Leu Leu Phe Glu His Arg Tyr Ser Val Asp 
100 105 HO 



Leu Leu Pro Phe Val Gin Lys Xaa Pro Lys Asp Ser Glu Ala Glu Ser 
115 120 125 

Lys Tyr Asp Pro Pro Phe Gly Phe Arg Lys Phe Ser Ser Lys Val Gin 
130 135 140 



Thr Leu Leu Glu Leu Leu Pro Glu His Asp Leu Pro Glu His Leu Lys 
35 145 150 155 160 

Ala Lys Thr Cys Arg Arg Cys Val Val He Gly Ser Gly Gly He Leu 
165 170 175 

40 His Gly Leu Glu Leu Gly His Thr Leu Asn Gin Phe Asp Val Val He 
180 185 190 



Arg Leu Asn Ser Ala Pro Val Glu Gly Tyr Ser Glu His Val Gly Asn 
195 200 205 

Lys Thr Thr He Arg Met Thr Tyr Pro Glu Gly Ala Pro Leu Ser Asp 
210 215 220 



Leu Glu Tyr Tyr Ser Asn Asp Leu Phe Val Ala Val Leu Phe Lys Ser 
50 225 230 235 240 

Val Asp Phe Asn Trp Leu Gin Ala Met Val Lys Lys Glu Thr Leu Pro 
245 250 255 

55 Phe Trp Val Arg Leu Phe Phe Trp Lys Gin Val Ala Glu Lys He Pro 
260 265 270 



60 



Leu Gin Pro Lys His Phe Arg He Leu Asn Pro Val He He Lys Glu 
275 280 285 



WO 98/42738 



PCT/US98/05311 



295 



Thr Ala Phe Xaa His Pro Ser Val Leu Arg Ala Ser Val Lys Val Leu 
290 295 300 

Gly Ala Glu He Arg Thr Ser Pro Gin Ser Val Ser Leu Pro Leu Ser 
5 305 310 315 320 

Xaa 



10 

(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 166: 

20 Met Thr Leu Asp Val Gin Thr Val Val Val Phe Ala Val He Val Val 
15 10 15 



25 



Leu Leu Leu Val Asn Val He Leu Met Phe Phe Leu Gly Thr Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 167: 

30 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

35 

Met Leu Pro Leu Leu Phe Cys Ala Phe Cys Leu His Lys Leu Gly Pro 
15 10 15 



Leu Leu Phe Leu Tyr Asp Val 

40 20 

Thr His Gin He Gin Leu Pro 
35 

45 Gin Val Leu Asn Lys Thr Leu 
50 55 



Leu Met Xaa His Glu Ala Val Met Arg 
25 30 

Asp Pro Glu Phe Pro Ser Gin Gin Asn 
40 45 

Phe Asn Lys Leu Lys Lys Lys Lys Lys 
60 



Lys Lys Lys Xaa Xaa Xaa Lys Lys 
65 70 

50 



(2) INFORMATION FOR SEQ ID NO: 168: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 
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Met Ala Ser Arg Gly Arg Arg Pro Glu His Gly Gly Pro Pro Glu Leu 
15 10 15 

Phe Tyr Asp Glu Thr Glu Ala Arg Lys Tyr Val Arg Asn Ser Arg Met 
5 20 25 30 

He Asp He Gin Thr Arg Met Ala Gly Arg Ala Leu Glu Leu Leu Tyr 
35 40 45 

10 Leu Pro Glu Asn Lys Pro Cys Tyr Leu Leu Asp He Gly Cys Gly Thr 
50 55 60 



15 



Gly Leu Ser Gly Ser Tyr Leu Ser Asp Glu Gly His Tyr Trp Val Gly 
65 70 75 80 

Leu Asp He Ser Pro Ala Met Leu Asp Glu Ala Val Asp Arg Glu He 
85 90 95 



Glu Gly Asp Leu Leu Leu Gly Asp Met Gly Gin Gly He Pro Phe Lys 
20 100 105 HO 

Pro Gly Thr Phe Asp Gly Cys He Ser He Ser Ala Val Gin Trp Leu 
115 120 125 

25 Cys Asn Ala Asn Lys Lys Ser Glu Asn Pro Ala Lys Arg Leu Tyr Cys 
130 135 140 



30 



Phe Phe Ala Ser Leu Phe Ser Val Leu Val Arg Gly Ser Arg Ala Val 
145 150 155 160 

Leu Gin Leu Tyr Pro Glu Asn Ser Glu Gin Leu Glu Leu He Thr Thr 
165 170 175 



Gin Ala Thr Lys Ala Gly Phe Ser Gly Gly Met Val Val Asp Tyr Pro 
35 180 185 190 

Asn Ser Ala Lys Ala Lys Lys Phe Tyr Leu Cys Leu Phe Ser Gly Pro 
195 200 205 

40 Ser Thr Phe lie Pro Glu Gly Leu Ser Glu Asn Gin Asp Glu Val Glu 
210 215 220 

Pro Arg Glu Ser Val Phe Thr Asn Glu Arg Phe Pro Leu Arg Met Ser 
225 230 235 240 

45 

Arg Arg Gly Met Val Arg Lys Ser Arg Ala Trp Val Leu Glu Lys Lys 
245 250 255 

Glu Arg His Arg Arg Gin Gly Arg Glu Val Arg Pro Asp Thr Gin Tyr 
50 260 265 270 

Thr Gly Arg Lys Arg Lys Pro Arg Phe Xaa 
275 280 

55 

(2) INFORMATION FOR SEQ ID NO: 169: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

5 Met Leu Gly Lys Thr Lys Phe Gin Ser Tyr Lys Ser Phe Ser Arg Lys 
15 10 15 

Leu Met Val Cys Pro Ser Thr 
20 

10 



(2) INFORMATION FOR SEQ ID NO: 170: 

15 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

20 

Met Tip Arg Pro Ser Val Leu Leu Leu Leu Leu Leu Leu Arg His Gly 
15 10 15 

Ala Gin Gly Lys Pro Ser Pro Asp Ala Gly Pro His Gly Gin Gly Arg 
25 20 25 30 

Val His Gin Ala Ala Pro Leu Ser Asp Ala Pro His Asp Asp Ala His 
35 40 45 

30 Gly Asn Phe Gin Tyr Asp His Glu Ala Phe Leu Gly Arg Glu Val Ala 
50 55 60 

Lys Glu Phe Asp Gin Leu Thr Pro Glu Glu Ser Gin Ala Arg Leu Gly 
65 70 75 80 

35 

Arg lie Val Asp Arg Met Asp Arg Ala Gly Asp Gly Asp Gly Trp Val 
85 90 95 

Ser Leu Ala Glu Leu Arg Ala Trp lie Ala His Thr Gin Gin Arg His 
40 100 105 110 

He Arg Asp Ser Val Ser Ala Ala Trp Asp Thr Tyr Asp Thr Asp Arg 
115 120 125 

45 Asp Gly Arg Val Gly Trp Glu Glu Leu Arg Asn Ala Thr Tyr Gly His 
130 135 140 

Tyr Ala Pro Gly Glu Glu Phe His Asp Val Glu Asp Ala Glu Thr Tyr 
145 150 155 160 

50 

Lys Lys Met Leu Ala Arg Asp Glu Arg Arg Phe Arg Val Ala Asp Gin 
165 170 175 

Asp Gly Asp Ser Met Ala Thr Arg Glu Glu Leu Thr Ala Phe Leu His 
55 180 185 190 

Pro Glu Glu Phe Pro His Met Arg Asp He Val He Ala Glu Thr Leu 
195 200 205 



60 Glu Asp Leu Asp Arg Asn Lys Asp Gly Tyr Val Gin Val Glu Glu Tyr 
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210 215 220 

He Ala Asp Leu Tyr Ser Ala Glu Pro Gly Glu Glu Glu Pro Ala Trp 
225 230 235 240 

5 

Val Gin Thr Glu Arg Gin Gin Phe Arg Asp Phe Arg Asp Leu Asn Lys 
245 250 255 

Asp Gly His Leu Asp Gly Ser Glu Val Gly His Trp Val Leu Pro Pro 
10 260 265 270 

Ala Gin Asp Gin Pro Leu Val Glu Ala Asn His Leu Leu His Glu Ser 
275 280 285 

15 Asp Thr Asp Lys Asp Gly Arg Leu Ser Lys Ala Xaa He Leu Gly Asn 
290 295 300 

Trp Asn Met Phe Val Gly Ser Gin Ala Thr Asn Tyr Gly Glu Asp Leu 
305 310 315 320 

20 

Thr Arg His His Asp Glu Leu Xaa 
325 



25 

(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 171: 

Met Cys Trp Leu Arg Ala Trp Xaa Gin He Xaa Leu Pro Val Phe Xaa 
35 1 5 10 15 

Ser xaa Phe Leu He Gin Leu Leu He Ser Phe Ser Glu Asn Gly Phe 
20 25 30 

40 He His Ser Pro Arg Asn Asn Gin Lys Pro Arg Asp Gly Asn Xaa Glu 
35 40 45 

Glu Cys Ala Val Lys Lys Ser Cys Gin Leu Cys Thr Glu Asp Lys Lys 
50 55 60 

45 

Tyr Met Met Asn Arg 
65 



50 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



Met Trp Leu Phe He Leu Leu Ser Leu Ala Leu He Ser Asp Ala Met 
60 1 5 10 15 
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Val Met Asp Glu Lys Val Lys Arg Ser Phe Val Leu Asp Thr Ala Ser 
20 25 30 

5 Ala He Cys Asn Tyr Asn Ala His Tyr Lys Asn His Pro Lys Tyr Trp 
35 40 45 

Cys Arg Gly Tyr Phe Arg Asp Tyr Cys Asn lie He Ala Phe Ser Pro 
50 55 60 

10 

Asn Ser Thr Asn His Val Ala Leu Lys Asp Thr Gly Asn Gin Leu He 
65 70 75 80 

Val Thr Met Ser Cys Leu Asn Lys Glu Asp Thr Gly Trp Tyr Trp Cys 
15 85 90 95 

Gly He Gin Arg Asp Phe Ala Arg Asp Asp Met Asp Phe Thr Glu Leu 
100 105 110 

20 He Val Thr Asp Asp Lys Gly Thr Trp Pro Met Thr Leu Val Trp Glu 
115 120 125 



25 



30 



35 



Arg Leu Ser Gly Thr Lys Pro Glu Ala Ala Arg Leu Pro Lys Leu Ser 
130 135 140 

Ala Arg Leu Thr Ala Pro Gly Arg Pro Phe Ser Ser Phe Ala Tyr Xaa 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 173: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

Met Ala Xaa His Phe Leu Leu Val Ala Leu Gin Ser Val Pro His Cys 
15 10 15 

45 Pro His Leu Leu Glu Glu Glu His Lys Leu Cys Lys Val Ser His Phe 
20 25 30 

Ser Gly Val Thr Leu Val Thr Ser Arg Gin Asp Ser Ser Ser Tyr Val 
35 40 45 



50 



Pro Val Gin Thr Leu Phe He His Leu Gly Pro Trp Ala Trp Asp Leu 
50 55 60 



Xaa Pro Cys Thr Ala Glu Asp Pro Glu Ala Glu Arg Ser Leu Arg Leu 

55 65 70 75 80 

Cys His Ser His Leu Ala Arg Xaa Asn Val Ser Pro Ser Gin Ala Ala 

85 90 95 



60 Glu Gly Xaa Xaa Xaa Arg Gly Cys Gin His Arg Gly Ser Arg Glu Leu 
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100 



105 



110 



Thr Phe Leu Ser Ala Glu Asn Glu Ala Gly lie 
115 120 



(2) INFORMATION FOR SEQ ID NO: 174: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

15 

Met Lys Val Gly Ala Arg lie Arg Val Lys Met Ser Val Asn Lys Ala 
15 10 15 

His Pro Val Val Ser Thr His Trp Arg Trp Pro Ala Glu Trp Pro Gin 
20 20 25 30 

Met Phe Leu His Leu Ala Gin Glu Pro Arg Thr Glu Val Lys Ser Arg 
35 40 45 

25 Pro Leu Gly Leu Ala Gly Phe lie Arg Gin Asp Ser Lys Thr Arg Lys 
50 55 60 

Pro Leu Glu Gin Glu Thr lie Met Ser Ala Ala Asp Thr Ala Leu Trp 
65 70 75 80 

30 

Pro Tyr Gly His Gly Asn Arg Glu His Gin Glu Asn Glu Leu Gin Lys 
85 90 95 

Tyr Leu Gin Tyr Lys Asp Met His Leu Leu Asp Ser Gly Gin Ser Leu 
35 100 105 110 

Gly His Thr His Thr Leu Gin Gly Ser His Asn Leu Thr Ala Leu Asn 
115 120 125 

40 lie 



45 (2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 
50 , (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

Met Ala Tyr His Ser Phe Leu Val Glu Pro lie Ser Cys His Ala Trp 
15 10 15 

55 

Asn Lys Asp Arg Thr Gin lie Ala He Cys Pro Asn Asn His Glu Val 
20 25 30 

His He Tyr Glu Lys Ser Gly Ala Lys Trp Thr Lys Val His Glu Leu 
60 35 40 45 
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10 



Lys Glu His Asn Gly Gin Val Thr Gly He Asp Trp Ala Pro Glu Ser 
50 55 60 

Asn Arg lie Val Thr Cys Gly Thr Asp Arg Asn Ala Tyr Val Trp Thr 
65 70 75 80 

Leu Lys Gly Arg Thr Trp Lys Pro Thr Leu Val He Leu Arg He Asn 
85 90 95 

Arg Ala Ala Arg Cys Val Arg Trp Ala Pro Asn Glu Asn Lys Phe Ala 
100 105 110 



Val Gly Ser Gly Ser Arg Val He Ser He Cys Tyr Phe Glu Gin Glu 
15 115 120 125 

Asn Asp Trp -Trp Val Cys Lys His He Lys Lys Pro He Arg Ser Thr 
130 135 140 

20 Val Leu Ser Leu Asp Trp His Pro Asn Asn Val Leu Leu Ala Ala Gly 
145 150 155 160 



25 



Ser Cys Asp Phe Lys Cys Arg He Phe Ser Ala Tyr He Lys Glu Val 
165 170 175 

Glu Glu Arg Pro Ala Pro Thr Pro Trp Gly Ser Lys Met Pro Phe Gly 
180 185 190 



Glu Leu Met Phe Glu Ser Ser Ser Ser Cys Gly Trp Val His Gly Val 
30 195 200 205 

Cys Phe Ser Ala Ser Gly Ser Arg Val Ala Trp Val Ser His Asp Ser 
210 215 220 

35 Thr Val Cys Leu Ala Asp Ala Asp Lys Lys Met Ala Val Ala Thr Leu 
225 230 235 240 



40 



Ala Ser Glu Thr Leu Pro Leu Leu Ala Leu Thr Phe He Thr Asp Asn 
245 250 255 

Ser Leu Val Ala Ala Gly His Asp Cys Phe Pro Val Leu Phe Thr Tyr 

260 265 270 



Asp Ala Ala Ala Gly Met Leu Ser Phe Gly Gly Arg Leu Asp Val Pro 
45 275 280 285 

Lys Gin Ser Ser Gin Arg Gly Leu Thr Ala Arg Glu Arg Phe Gin Asn 
290 295 300 

50 Leu Asp Lys Lys Ala Ser Ser Glu Gly Gly Thr Ala Ala Gly Ala Gly 
305 310 315 320 



55 



Leu Asp Ser Leu His Lys Asn Ser Val Ser Gin He Ser Val Leu Ser 
325 330 335 

Gly Gly Lys Ala Lys Cys Ser Gin Phe Cys Thr Thr Gly Met Asp Gly 
340 345 350 



60 



Gly Met Ser He Trp Asp Val Lys Ser Leu Glu Ser Ala Leu Lys Asp 
355 360 365 
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Leu Lys lie Lys 
370 



(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

15 Met Trp Ser lie Gly Ala Gly Ala Leu Gly Ala Ala Ala Leu Ala Leu 
15 10 15 



20 



Leu Leu Ala Asn Thr Asp Val Phe Leu Ser Lys Pro Gin Lys Ala Ala 
20 25 30 

Leu Glu Tyr Leu Glu Asp He Asp Leu Lys Thr Leu Glu Lys Glu Pro 
35 40 45 



Arg Thr Phe Lys Ala Lys Glu Leu Trp Glu Lys Asn Gly Ala Val He 
25 50 55 60 

Met Ala Val Arg Arg Pro Gly Cys Phe Leu Cys Arg Glu Glu Ala Ala 
65 70 75 80 

30 Asp Leu Ser Ser Leu Lys Ser Met Leu Asp Gin Leu Gly Val Pro Leu 

85 90 95 



35 



Tyr Ala Val Val Lys Glu His He Arg Thr Glu Val Lys Asp Phe Gin 
100 105 HO 

Pro Tyr Phe Lys Gly Glu He Phe Leu Asp Glu Lys Lys Lys Phe Tyr 
115 120 125 



Gly Pro Gin Arg Arg Lys Met Met Phe Met Gly Phe He Arg Leu Gly 
40 130 135 140 

Val Trp Tyr Asn Phe Phe Arg Ala Trp Asn Gly Gly Phe Ser Gly Asn 
145 150 155 160 

45 Leu Glu Gly Glu Gly Phe He Leu Gly Gly Val Phe Val Val Gly Ser 
165 170 175 



50 



Gly Lys Gin Gly He Leu Leu Glu His Arg Glu Lys Glu Phe Gly Asp 
180 185 190 

Lys Val Asn Leu Leu Ser Val Leu Glu Ala Ala Lys Met He Lys Pro 
195 200 205 



Gin Thr Leu Ala Ser Glu Lys Lys 
55 210 215 



(2) INFORMATION FOR SEQ ID NO: 177: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

Met Lys Pro Val Ser Arg Arg Thr Leu Asp Trp lie Tyr Ser Val Leu 
15 10 15 

10 Leu lieu Ala lie Val Leu lie Ser Trp Gly Cys lie He Tyr Ala Ser 
20 25 30 

Met Val Ser Ala Arg Arg Gin Leu Arg Lys Lys Tyr Pro Asp Lys He 
35 40 45 

Phe Gly Thr Asn Glu Asn Leu 
50 55 



15 



20 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

Met Ala Ala Asn Thr Phe Val Leu He Met Gly He Pro Thr Ser Ala 
30 1 5 10 15 

Asn Ala Xaa Arg Asp Leu Phe 
20 

35 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

45 Met Ser He Cys His Arg Gly Thr Gly He Ala Leu Ser Ala Gly Val 
15 10 15 

Ser Leu Phe Gly Met Ser Ala Leu Leu Leu Pro Gly Asn Phe Glu Ser 
20 25 30 

50 

Tyr Leu Glu Leu Val Lys Ser Leu Cys Leu Gly Pro Ala Leu He His 
35 40 45 

Thr Ala Lys Phe Ala Leu Val Phe Pro Leu Met Tyr His Thr Trp Asn 
55 50 55 60 

Gly He Arg His Leu Met Trp Asp Leu Gly Lys Gly Leu Lys He Pro 
65 70 75 80 

60 Gin Leu Tyr Gin Ser Gly Val Val Val Leu Val Leu Thr Val Leu Ser 
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85 90 95 

Ser Met Gly Leu Ala Ala Met 
100 

5 



(2) INFORMATION FOR SEQ ID NO: 180: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

15 

Met Thr Lys Ala Ser Ser Leu Trp Pro Leu Lys Thr Thr Cys Gin lie 
15 10 15 

Ser Gly Thr Val Phe Phe Phe Leu Phe Leu Phe Ser Cys Phe Leu Met 
20 20 25 30 

Gin Ala Gin Cys Asp Lys Phe Val Gly Trp Asp Phe Phe Phe Phe Leu 
35 40 45 

25 



30 (2) INFORMATION FOR SEQ ID NO: 181*. 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

Met Arg Arg Ala Leu lie Pro Pro Cys Arg Gly Gly Pro Ser Ala Ser 
15 10 15 

40 

Asp Xaa Cys Cys Ser Cys Ser Pro Ser Gly Phe Ser Ala Gly Arg Gly 
20 25 30 

Arg Cys Pro Val Gin Gly Cys Leu Arg Pro His Arg Val Gin Leu Leu 
45 35 40 45 

Arg Arg Trp Gly Pro Gly Ser Pro Ala Gly Gin Arg Leu Ser Lys Gly 
50 55 60 

50 Phe Gin Leu Leu Arg Trp Trp Gly Pro Gly Ser Pro Ala Pro Glu Pro 
65 70 75 80 

Arg Lys Gly Pro Phe Pro Pro Pro Asp Pro Pro Trp Pro Val Thr Leu 
85 90 95 

55 



60 
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(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

Met Leu Glu Thr Thr Lys His Val Gin lie Ala Cys Met Leu Leu Leu 
10 1 5 10 15 

Thr Cys Gin He Phe Leu Pro Ser Ser Leu Ser Pro Ser Phe He His 
20 25 30 

15 Ser Leu Thr Asp Ser Phe He Pro Leu Lys Lys Leu Tyr Val Cys Phe 
35 40 45 

Val Gin Ser Thr Leu Leu Lys Ala Ala Gly Tyr Lys Ser He Ser Glu 
50 55 60 

20 

Ala Leu Gly Phe Asp Xaa Leu Leu Cys Ser Ser Ala Arg Phe Val Trp 
65 70 75 80 

He Cys His Thr Tyr Ser Arg Pro Leu Val Thr Cys Ala Leu His 
25 85 90 95 



30 



(2) INFORMATION FOR SEQ ID NO: 183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

Met Ser Val He Gly Gly Leu Leu Leu Val Val Ala Leu Gly Pro Gly 
15 10 15 

40 Gly Val Ser Met Asp Glu Lys Lys Lys Glu Trp 
20 25 



45 (2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

Met Ser Gly Gly Leu Ser Phe Leu Leu Leu Val 
15 10 

55 



(2) INFORMATION FOR SEQ ID NO: 185: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

5 

Met Phe Ala Asp Phe lie Val Val Thr Ala Thr Val Gin Arg Cys Pro 
15 10 15 

Gly Ser Pro Pro Leu Ser Glu He Leu Trp Lys Asp Glu Pro Phe Ala 
10 20 25 30 

He Ser Ser His Ala Gly Leu Pro Trp Leu Ser Ser Trp Pro Ala Pro 
35 40 45 

15 Pro Trp Thr Trp Ser Trp He Ser Arg Arg Arg Glu His Gly Arg Gly 
50 55 60 



20 



ser 
65 



(2) INFORMATION FOR SEQ ID NO: 186: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

30 

Met Val Glu Ser Val Met Pro Val Val Val Cys Thr Leu Ser Pro Gly 
15 10 15 

He Asp Ser Ser Pro Ser 
35 20 



(2) INFORMATION FOR SEQ ID NO: 187: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

Met Asp Val Leu Phe Val Ala He Phe Ala Val Pro Leu He Leu Gly 
15 10 15 

50 Gin Glu Tyr Glu Asp Glu Glu Arg Leu Gly Glu Asp Glu Tyr Tyr Gin 
20 25 30 

Val Val Tyr Tyr Tyr Thr Val Thr Pro Ser Tyr Asp Asp Phe Ser Ala 
35 40 45 

55 

Asp Phe Thr He Asp Tyr Ser He Phe Glu Ser Glu Asp Arg Leu Asn 
50 55 60 

Arg Leu Asp Lys Asp He Thr Glu Ala He Glu Thr Thr He Ser Leu 
60 65 70 75 80 
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Glu Thr Ala Arg Ala Asp His Pro Lys Pro Val Thr Val Lys Pro Val 
85 90 95 

5 Thr Thr Glu Pro Gin Ser Pro Asp Leu Asn Asp Ala Val Ser Ser Leu 
100 105 110 



10 



Arg Ser Pro lie Pro Leu Leu Leu Ser Cys Ala Phe Val Gin Val Gly 
115 120 125 

Met Tyr Phe Met 
130 



15 

(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

Met Pro Cys Gin Pro Gly Gin Val Pro Ser Cys Gin Cys Thr Phe Gly 
25 1 5 10 15 

Leu Leu Leu Met Leu Pro Ser Leu Pro Ser Pro Ala Ser Gin Pro Arg 
20 25 30 

30 Pro Phe Cys Ser Ser Met Glu Tyr Phe His Gly Cys Ala Ser Pro Ser 
35 40 45 

Gin Ala lie lie Gly Gly Phe Pro Phe Ala Ser Val Ala Leu Ala Asp 
50 55 60 

35 

lie Leu Cys Leu Gin 
65 



40 

(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

Met Ser Leu Leu Ser Pro Ala lie Pro Ala Leu Thr Leu lie Phe lie 
50 1 5 10 15 

Leu Met Phe Phe Ser Phe Pro Phe Arg Ala His Thr Val Val Thr lie 
20 25 30 

55 Val Ala Ser Gly Phe Leu Gly Leu Ser Pro Leu Cys Gly 
35 40 45 



60 (2) INFORMATION FOR SEQ ID NO: 190: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 190: 

Gly Leu Gin Met Phe He Gin Arg Lys Phe Pro Tyr Pro 
5 10 15 

Ser Leu Leu Val Ala Val Val Ala Gly Ser Val Val Ser 
20 25 30 

Tyr Gly Val Thr Arg Val Glu Ser Glu Lys Cys Asn Asn Leu Trp Leu 
15 35 40 45 

Phe Leu Glu Thr Gly Gin Leu Pro Lys Asp Arg Ser Thr Asp Gin Arg 
50 55 60 

20 Ser 

65 



10 



Met Ala Phe 
1 

Leu Gin Trp 



25 (2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
30 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

Leu Gly Met He Phe Ser Met Cys Gly Leu Met Leu Lys 
5 10 is 

Cys Ala Trp Val Ala Val Tyr Cys Ser Phe He Ser Phe 
20 25 30 

Ala Asn Ser Arg Ser Ser Glu Asp Thr Lys Gin Met Met Ser Ser Phe 
40 35 40 45 

Met Xaa 
50 

45 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 170 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

55 Met Leu Leu Asn Val Ala Leu Val Ala Leu Val Leu Leu Gly Ala Tyr 
15 10 15 

Arg Leu Trp Val Arg Trp Gly Arg Arg Gly Leu Gly Ala Gly Ala Gly 
20 25 30 

60 



35 



Met Asn Leu 
1 

Leu Lys Trp 
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Ala Gly Glu Glu Ser Pro Ala Thr Ser Leu Pro Arg Met Lys Lys Arg 
35 40 45 

Asp Phe Ser Leu Glu Gin Leu Arg Gin Tyr Asp Gly Ser Arg Asn Pro 
5 50 55 60 

Arg lie Leu Leu Ala Val Asn Gly Lys Val Phe Asp Val Thr Lys Gly 
65 70 75 80 

10 Ser Lys Phe Tyr Gly Pro Ala Gly Pro Tyr Gly lie Phe Ala Gly Arg 

85 90 95 

Asp Ala Ser Arg Gly Leu Ala Thr Phe Cys Leu Asp Lys Asp Ala Leu 
100 105 110 

15 

Arg Asp Glu Tyr Asp Asp Leu Ser Asp Leu Asn Ala Val Gin Met Glu 
115 120 125 

Ser Val Arg Glu Trp Glu Met Gin Phe Lys Glu Lys Tyr Asp Tyr Val 
20 130 135 140 

Gly Arg Leu Leu Lys Pro Gly Glu Glu Pro Ser Glu Tyr Thr Asp Glu 
145 150 155 160 

25 Glu Asp Thr Lys Asp His Asn Lys Gin Asp 
165 170 



30 (2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



40 



Met Thr Tyr Phe Ser Gly Leu Leu Val lie Leu Ala Phe Ala Ala Trp 
15 10 15 

Val Ala Leu Ala Glu Gly Leu Gly Val Ala Val Tyr Ala Ala Ala Val 
20 25 30 



Leu Leu Gly Ala Gly Cys Ala Thr lie Leu Val Thr Ser Leu Ala Met 
45 35 40 45 

Thr Ala Asp Leu lie Gly Pro His Thr Asn Ser Gly Leu Ser Cys Thr 
50 55 60 

50 Ala Pro 
65 



55 (2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 
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(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

Met Ala Ala Gly Pro Ser Gly Cys Leu Val Pro Ala Phe Gly Leu Arg 
15 10 15 

5 

Leu Leu Leu Ala Thr Val Leu Gin Ala Val Ser Ala Phe Gly Ala Glu 
20 25 30 

Phe Ser Ser Glu Ala Cys Arg Glu Leu Gly Phe Ser Ser Asn Leu Leu 
10 35 40 45 

Cys Ser Ser Cys Asp Leu Leu Gly Gin Phe Asn Leu Leu Gin Leu Asp 
50 55 60 

15 Pro Asp Cys Arg Gly Cys Cys Gin Glu Glu Ala Gin Phe Glu Thr Lys 
65 70 75 80 

Lys Leu Tyr Ala Gly Ala lie Leu Glu Val Cys Gly 
85 90 

20 



(2) INFORMATION FOR SEQ ID NO: 195: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

30 

Met Arg Gly Ser His Leu Arg Leu Leu Pro Tyr Leu Val Ala Ala Asn 
1 5 10 15 

Pro Val Asn Tyr Gly Arg Pro Tyr Arg Leu Ser Cys Val Glu Ala Phe 
35 20 25 30 

Ala Ala Thr Phe Cys lie Val Gly Phe Pro Asp Leu Ala Val lie Leu 
35 40 45 

40 Leu Arg Lys Phe Lys Trp Gly Lys Gly Phe Leu Asp Leu Asn Arg Gin 
50 55 60 

Leu Leu Asp Lys Tyr Ala Ala Cys Gly Ser Pro Glu Glu Val Leu Gin 
65 70 75 80 

45 

Ala Glu Gin Glu Phe Leu Ala Asn Ala Lys Glu Ser Pro Gin Glu Glu 
85 90 95 

Glu lie Asp Pro Phe Asp Val Asp Ser Gly Arg Glu Phe Gly Asn Pro 
50 100 105 110 

Asn Arg Pro Val Ala Ser Thr Arg Leu Pro Ser Asp Thr Asp Asp Ser 
115 120 125 

55 Asp Ala Ser Glu Asp Pro Gly Pro Xaa Ala Glu Arg Gly Gly Ala Ser 
130 135 140 



60 



Ser Ser Cys Cys Glu Glu Glu Gin Thr Gin Gly Arg Gly Ala Glu Ala 
145 150 155 160 
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25 



35 



311 



Arg Ala Pro Ala Glu Val Trp Lys Gly He Lys Lys Arg Gin Arg Asp 
165 170 175 



(2) INFORMATION FOR SEQ ID NO: 196: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Met Ser Asn Ala Cys Lys Glu Leu Ala He Phe Leu Thr Thr Gly He 
15 10 15 

20 Val Val Ser Ala Phe Gly Leu Pro He Val Phe Ala Arg Ala His Leu 
20 25 30 

lie Glu Trp Gly Ala Cys Ala Leu Val Leu Thr Gly Asn Thr Val He 
35 40 45 



Phe Ala Thr He Leu Gly Phe Phe Leu Val Phe Gly Ser Asn Asp Asp 
50 55 60 



Phe Ser Trp Gin Gin Trp 
30 65 70 



(2) INFORMATION FOR SEQ ID NO: 197: 



ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

Met Thr Leu Leu lie lie Phe Leu Pro Phe Xaa Phe Thr Thr Xaa Thr 
15 10 15 

45 Asn Ser Gly Gly Ser Phe Pro Val Arg 
20 25 



50 (2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Met Lys Gly Glu Leu Leu Pro Phe Leu Phe Leu Thr Val Trp Leu Trp 
15 10 15 

60 



WO 98/42738 



PCT/US98/05311 



312 



Leu Tyr Lys Leu Xaa Phe Gly Glu Ser Pro Arg Tyr Pro Asn Val lie 
20 25 30 

Gly Lys Thr Tyr Phe Phe Phe Trp Thr Asp Gin lie Ser Arg Glu Ser 
5 35 40 45 

Arg Phe Leu Glu Arg Leu Ala Phe lie Val Ser Glu Asn Cys Leu He 
50 55 60 

10 Phe Leu He His Ala He Thr Gly Gin 
65 70 



15 (2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

Met Ser Gly Phe Ser Thr Glu Glu Arg Ala Ala Pro Phe Ser Leu Glu 
15 10 15 

25 

Tyr Arg Val Phe Leu Lys Asn Glu Lys Gly Gin Tyr He Ser Pro Phe 
20 25 30 

His Asp He Pro He Tyr Ala Asp Lys Asp Val Phe His Met Val Val 
30 35 40 45 

Glu Val Pro Arg Trp Ser Asn Ala Lys Met Glu He Ala Thr Lys Asp 
50 55 60 

35 Pro Leu Asn Pro He Lys Gin Asp Val Lys Lys Gly Lys Leu Arg Tyr 
65 70 75 80 

Val Ala Asn Leu Phe Pro Tyr Lys Gly Tyr He Trp Asn Tyr Gly Ala 
85 90 95 

40 

He Pro Gin Thr Trp Glu Asp Pro Gly His Asn Asp Lys His Thr Gly 
100 105 HO 

Cys Cys Gly Asp Asn Asp Pro He Asp Val Cys Glu He Gly Ser Lys 
45 115 120 125 

Val Cys Ala Arg Gly Glu He He Gly Val Lys Val Leu Gly He Leu 
130 135 140 

50 Ala Met lie Asp Glu Gly Glu Thr Asp Trp Lys Val He Ala He Asn 
145 150 155 160 

Val Asp Asp Pro Asp Ala Ala Asn Tyr Asn Asp lie Asn Asp Val Lys 
165 170 175 

55 

Arg Leu Lys Pro Gly Tyr Leu Glu Ala Thr Val Asp Trp Phe Arg Arg 
180 185 190 



Tyr Lys Val Pro Asp Gly Lys Pro Glu Asn Glu Phe Ala Phe Asn Ala 
60 195 200 205 
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Glu Phe Lys Asp Lys Asp Phe Ala lie Asp lie lie Lys Ser Thr His 
210 215 220 

5 Asp His Trp Lys Ala Leu Val Thr Lys Lys Thr Asn Gly Lys Gly lie 
225 230 235 240 

Ser Cys Met Asn Thr Thr Leu Ser Glu Ser Pro Phe Lys Cys Asp Pro 
245 250 255 

10 

Asp Ala Ala Arg Ala lie Val Asp Ala Leu Pro Pro Pro Cys Glu Ser 
260 265 270 

Ala Cys Thr Val Pro Thr Asp Val Asp Lys Trp Phe His His Gin Lys 
15 275 280 285 

Asn 



20 

(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 625 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

30 Met Glu lie Pro Gly Ser Leu Cys Lys Lys Val Lys Leu Ser Asn Asn 
15 10 15 

Ala Gin Asn Trp Gly Met Gin Arg Ala Thr Asn Val Thr Tyr Gin Ala 
20 25 30 

35 

His His Val Ser Arg Asn Lys Arg Gly Gin Val Val Gly Thr Arg Gly 
35 40 45 

Gly Phe Arg Gly Cys Thr Val Trp Leu Thr Gly Leu Ser Gly Ala Gly 
40 50 55 60 

Lys Thr Thr Val Ser Met Ala Leu Glu Glu Tyr Leu Val Cys His Gly 
65 70 75 80 

45 lie Pro Cys Tyr Thr Leu Asp Gly Asp Asn lie Arg Gin Gly Leu Asn 

85 90 95 

Lys Asn Leu Gly Phe Ser Pro Glu Asp Arg Glu Glu Asn Val Arg Arg 
100 105 110 

50 

He Ala Glu Val Ala Lys Leu Phe Ala Asp Ala Gly Leu Val Cys He 
115 120 125 

Thr Ser Phe He Ser Pro Tyr Thr Gin Asp Arg Asn Asn Ala Arg Gin 
55 130 135 140 

He His Glu Gly Ala Ser Leu Pro Phe Phe Glu Val Phe Val Asp Ala 
145 150 155 160 



60 Pro Leu His Val Cys Glu Gin Arg Asp Val Lys Gly Leu Tyr Lys Lys 
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165 



170 



175 



Ala Arg Ala Gly Glu lie Lys Gly Phe Thr Gly He Asp Ser Glu Tyr 
180 185 190 

5 

Glu Lys Pro Glu Ala Pro Glu Leu Val Leu Lys Thr Asp Ser Cys Asp 
195 200 205 

Val Asn Asp Cys Val Gin Gin Val Val Glu Leu Leu Gin Glu Arg Asp 
10 210 215 220 

He Val Pro Val Asp Ala Ser Tyr Glu Val Lys Glu Leu Tyr Val Pro 
225 230 235 240 

15 Glu Asn Lys Leu His Leu Ala Lys Thr Asp Ala Glu Thr Leu Pro Ala 
245 250 255 



20 



Leu Lys He Asn Lys Val Asp Met Gin Trp Val Gin Val Leu Ala Glu 
260 265 270 

Gly Trp Ala Thr Pro Leu Asn Gly Phe Met Arg Glu Arg Glu Tyr Leu 
275 280 285 



Gin Cys Leu His Phe Asp Cys Leu Leu Asp Gly Gly Val He Asn Leu 
25 290 295 300 

Ser Val Pro He Val Leu Thr Ala Thr His Glu Asp Lys Glu Arg Leu 
305 310 315 320 

30 Asp Gly Cys Thr Ala Phe Ala Leu Met Tyr Glu Gly Arg Arg Val Ala 
325 330 335 



35 



He Leu Arg Asn Pro Glu Phe Phe Glu His Arg Lys Glu Glu Arg Cys 
340 345 350 

Ala Arg Gin Trp Gly Thr Thr Cys Lys Asn His Pro Tyr He Lys Met 
355 360 365 



Val Met Glu Gin Gly Asp Trp Leu He Gly Gly Asp Leu Gin Val Leu 
40 370 375 380 

Asp Arg Val Tyr Trp Asn Asp Gly Leu Asp Gin Tyr Arg Leu Thr Pro 
385 390 395 400 

45 Thr Glu Leu Lys Gin Lys Phe Lys Asp Met Asn Ala Asp Ala Val Phe 
405 410 415 



50 



Ala Phe Gin Leu Arg Asn Pro Val His Asn Gly His Ala Leu Leu Met 
420 425 430 

Gin Asp Thr His Lys Gin Leu Leu Glu Arg Gly Tyr Arg Arg Pro Val 
435 440 445 



Leu Leu Leu His Pro Leu Gly Gly Trp Thr Lys Asp Asp Asp Val Pro 
55 450 455 460 



Leu Met Trp Arg Met Lys Gin His Ala Ala Val Leu Glu Glu Gly Val 
465 470 475 480 

60 Leu Asn Pro Glu Thr Thr Val Val Ala He Phe Pro Ser Pro Met Met 
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485 490 495 

Tyr Ala Gly Pro Thr Glu Val Gin Trp His Cys Arg Ala Arg Met Val 
500 505 510 

5 

Ala Gly Ala Asn Phe Tyr lie Val Gly Arg Asp Pro Ala Gly Met Pro 
515 520 525 

His Pro Glu Thr Gly Lys Asp Leu Tyr Glu Pro Ser His Gly Ala Lys 
10 530 535 540 

Val Leu Thr Met Ala Pro Gly Leu lie Thr Leu Glu He Val Pro Phe 
545 550 555 560 

15 Arg Val Ala Ala Tyr Asn Lys Lys Lys Lys Arg Met Asp Tyr Tyr Asp 
565 570 575 

Ser Glu His His Glu Asp Phe Glu Phe He Ser Gly Thr Arg Met Arg 
580 585 590 

20 

Lys Leu Ala Arg Glu Gly Gin Lys Pro Pro Glu Gly Phe Met Ala Pro 
595 600 605 

Lys Ala Trp Thr Val Leu Thr Glu Tyr Tyr Lys Ser Leu Glu Lys Ala 
25 610 615 620 

Xaa 

625 

30 

J2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 649 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

40 Met Ser Ala Ser Gin Asp Leu Glu Pro Lys Pro Leu Phe Pro Lys Pro 
15 10 15 

Ala Phe Gly Gin Lys Pro Pro Leu Ser Thr Glu Asn Ser His Glu Asp 
20 ' 25 30 

45 

Glu Ser Pro Met Lys Asn Val Ser Ser Ser Lys Gly Ser Pro Ala Pro 
35 40 45 

Leu Gly Val Arg Ser Lys Ser Gly Pro Leu Lys Pro Ala Arg Glu Asp 
50 50 55 60 

Ser Glu Asn Lys Asp His Ala Gly Glu He Ser Ser Leu Pro Phe Pro 
65 70 75 80 

55 Gly Val Val Leu Lys Pro Ala Ala Ser Arg Gly Gly Pro Gly Leu Ser 

85 90 95 

Lys Asn Gly Glu Glu Lys Lys Glu Asp Arg Lys He Asp Ala Ala Lys 
100 105 110 

60 
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Asn Thr Phe Gin Ser Lys He Asn Gin Glu Glu Leu Ala Ser Gly Thr 
115 120 125 

Pro Pro Ala Arg Phe Pro Lys Ala Pro Ser Lys Leu Thr Val Gly Gly 

130 135 140 



10 



Pro Trp Gly Gin Ser Gin Glu Lys Glu Lys Gly Asp Lys Asn Ser Ala 
145 150 155 160 

Thr Pro Lys Gin Lys Pro Leu Pro Pro Leu Phe Thr Leu Gly Pro Pro 
165 170 175 



15 



Pro Pro Lys Pro Asn Arg Pro Pro Asn Val Asp Leu Thr Lys Phe His 
180 185 190 

Lys Thr Ser Ser Gly Asn Ser Thr Ser Lys Gly Gin Thr Ser Tyr Ser 

195 200 205 



Thr Thr Ser Leu Pro Pro Pro Pro Pro Ser His Pro Ala Ser Gin Pro 
20 210 215 220 



25 



Pro Leu Pro Ala Ser His Pro Ser Gin Pro Pro Val Pro Ser Leu Pro 
225 230 235 240 

Pro Arg Asn He Lys Pro Pro Phe Asp Leu Lys Ser Pro Val Asn Glu 
245 250 255 



30 



Asp Asn Gin Asp Gly Val Thr His Ser Asp Gly Ala Gly Asn Leu Asp 
260 265 270 

Glu Glu Gin Asp Ser Glu Gly Glu Thr Tyr Glu Asp He Glu Ala Ser 
275 280 285 



Lys Glu Arg Glu Lys Lys Arg Glu Lys Glu Glu Lys Lys Arg Leu Glu 
35 290 295 300 



40 



Leu Glu Lys Lys* Glu Gin Lys Glu Lys Glu Lys Lys Glu Gin Glu He 

305 310 315 320 

Lys Lys Lys Phe Lys Leu Thr Gly Pro He Gin Val He His Leu Ala 

325 330 335 



45 



Lys Ala Cys Cys Asp Val Lys Gly Gly Lys Asn Glu Leu Ser Phe Lys 
340 345 350 

Gin Gly Glu Gin He Glu He He Arg He Thr Asp Asn Pro Glu Gly 
355 360 365 



Lys Trp Leu Gly Arg Thr Ala Arg Gly Ser Tyr Gly Tyr He Lys Thr 
50 370 375 380 



55 



Thr Ala Val Glu He Asp Tyr Asp Ser Leu Lys Leu Lys Lys Asp Ser 
385 390 395 400 

Leu Gly Ala Pro Ser Arg Pro He Glu Asp Asp Gin Glu Val Tyr Asp 
405 410 415 



60 



Asp Val Ala Glu Gin Asp Asp He Ser Ser His Ser Gin Ser Gly Ser 
420 425 430 
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Gly Gly He Phe Pro Pro Pro Pro Asp Asp Asp He Tyr Asp Gly He 
435 440 445 

Glu Glu Glu Asp Ala Asp Asp Gly Ser Thr Leu Gin Val Gin Glu Lys 
5 450 455 460 

Ser Asn Thr Trp Ser Trp Gly He Leu Lys Met Leu Lys Gly Lys Asp 
465 470 475 480 

10 Asp Arg Lys Lys Ser He Arg Glu Lys Pro Lys Val Ser Asp Ser Asp 
485 490 495 



15 



Asn Asn Glu Gly Ser Ser Phe Pro Ala Pro Pro Lys Gin Leu Asp Met 
500 505 510 

Gly Asp Glu Val Tyr Asp Asp Val Asp Thr Ser Asp Phe Pro Val Ser 
515 520 525 



Ser Ala Glu Met Ser Gin Gly Thr Asn Val Gly Lys Ala Lys Thr Glu 
20 530 535 540 



25 



Glu Lys Asp Leu Lys Lys Leu Lys Lys Gin Xaa Lys Xaa Xaa Lys Asp 
545 550 555 560 

Phe Arg Lys Lys Phe Lys Tyr Asp Gly Glu He Arg Val Leu Tyr Ser 
565 570 575 



30 



Thr Lys Val Thr Thr Ser He Thr Ser Lys Lys Trp Gly Thr Arg Asp 
580 585 590 

Leu Gin Val Lys Pro Gly Glu Ser Leu Glu Val He Gin Thr Thr Asp 
595 600 605 



Asp Thr Lys Val Leu Cys Arg Asn Glu Glu Gly Lys Tyr Gly Tyr Val 
35 610 615 620 



40 



Leu Arg Ser Tyr Leu Ala Asp Asn Asp Gly Glu He Tyr Asp Asp He 
625 630 635 640 

Ala Asp Gly Cys He Tyr Asp Asn Asp 
645 



45 (2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 202: 

Met Ala Trp Pro Ser Arg Ser Lys Met Phe Thr Leu Leu Pro Val Leu 
15 10 15 

55 

Cys Tyr Leu Trp Ser Leu Trp Leu Pro Gin Phe Ser Trp He Gin Glu 
20 25 30 



Leu Lys Ala Val Leu Arg Asp Asp Gly Leu He Ser Ala Val Ala Trp 
60 35 40 45 
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Asn Ala Glu Phe Gin Thr Cys 
50 55 



(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

15 Met Val Lys Val Thr Phe Asn Ser Ala Leu Ala Gin Lys Glu Ala Lys 
15 10 15 



20 



35 



50 



Lys Asp Glu Pro Lys Ser Gly Glu Glu Ala Leu lie He Pro Pro Asp 
20 25 30 

Ala Val Ala Val Asp Cys Lys Asp Pro Asp Asp Val Val Pro Val Gly 
35 40 45 



Gin Arg Arg Ala Trp Cys Trp Cys Met Cys Phe Gly Leu Ala Phe Met 
25 50 55 60 

Leu Ala Gly Val He Leu Gly Gly Ala Tyr Leu Tyr Lys Tyr Phe Ala 
65 70 75 80 

30 Leu Gin Pro Asp Asp Val Tyr Tyr Cys Gly He Lys Tyr He Lys Asp 

85 90 95 



Asp Val He Leu Asn Glu Pro Ser Ala Asp Ala Pro Ala Ala Leu Tyr 
100 105 HO 

Gin Thr He Glu Glu Asn He Lys He Phe Glu Glu Glu Glu Val Glu 
115 120 125 



Phe He Ser Val Pro Val Pro Glu Phe Ala Asp Ser Asp Pro Ala Asn 
40 130 135 140 

He Val His Asp Phe Asn Lys Lys Leu Thr Ala Tyr Leu Asp Leu Asn 
145 150 155 160 

45 Leu Asp Lys Cys Tyr Val He Pro Leu Asn Thr Ser He Val Met Pro 
165 170 175 



Pro Arg Asn Leu Leu Glu Leu Leu He Asn He Lys Ala Gly Thr Tyr 
180 185 190 

Leu Pro Gin Ser Tyr Leu lie His Glu His Met Val He Thr Asp Arg 
195 200 205 



lie Glu Asn lie Asp His Leu Gly Phe Phe He Tyr Arg Leu Cys His 
55 210 215 220 

Asp Lys Glu Thr Tyr Lys Leu Gin Arg Arg Glu Thr He Lys Gly He 
225 230 235 240 

60 Gin Lys Arg Glu Ala Ser Asn Cys Phe Ala He Arg His Phe Glu Asn 



WO 98/42738 



PCT/US98/05311 



319 



245 



250 



255 



Lys Phe Ala Val Glu Thr Leu lie Cys Ser Xaa 
260 265 



(2) INFORMATION FOR SEQ ID NO: 204: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 



15 



Met Asp Leu Arg Gin Phe Leu Met Cys Leu Ser Leu Cys Thr Ala Phe 
15 10 15 



Ala Leu Ser Lys Pro Thr Glu Lys Lys Asp Arg Val His His Glu Pro 
20 20 25 30 

Gin Leu Ser Asp Lys Val His Asn Asp Ala Gin Ser Phe Asp Tyr Asp 
35 40 45 

25 His Asp Ala Phe Leu Gly Ala Glu Glu Ala Lys Thr Phe Asp Gin Leu 
50 55 60 

Thr Pro Glu Glu Ser Lys Glu Arg Leu Gly Lys lie Val Ser Lys lie 
65 70 75 80 

30 

Asp Gly Asp Lys Asp Gly Phe Val Thr Val Asp Glu Leu Lys Asp Trp 
85 90 95 

lie Lys Phe Ala Gin Lys Arg Trp lie Tyr Glu Asp Val Glu Arg Gin 
35 100 105 110 

Trp Lys Gly His Asp Leu Asn Glu Asp Gly Leu Val Ser Trp Glu Glu 
115 120 125 

40 Tyr Lys Asn Ala Thr Tyr Gly Tyr Val Leu Asp Asp Pro Asp Pro Asp 
130 135 140 

Asp Gly Phe Asn Tyr Lys Gin Met Met Val Arg Asp Glu Arg Arg Phe 
145 150 155 160 

45 

Lys Met Ala Asp Lys Asp Gly Asp Leu lie Ala Thr Lys Glu Glu Phe 
165 170 175 

Thr Ala Phe Leu His Pro Glu Glu Tyr Asp Tyr Met Lys Asp He Val 
50 180 185 190 

Val Gin Glu Thr Met Glu Asp He Asp Lys Asn Ala Asp Gly Phe He 
195 200 205 

55 Asp Leu Glu Glu Tyr He Gly Asp Met Tyr Ser His Asp Gly Asn Thr 
210 215 220 



60 



Asp Glu Pro Glu Trp Val Lys Thr Glu Arg Glu Gin Phe Val Glu Phe 
225 230 235 240 
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Arg Asp Lys Asn Arg Asp Gly Lys Met Asp Lys Glu Glu Thr Lys Asp 
245 250 255 

Trp lie Leu Pro Ser Asp Tyr Asp His Ala Glu Ala Glu Ala Arg His 
5 260 265 270 

Leu Val Tyr Glu Ser Asp Gin Asn Lys Asp Gly Lys Leu Thr Lys Glu 
275 280 285 

10 Glu lie Val Asp Lys Tyr Asp Leu Phe Val Gly Ser Gin Ala Thr Asp 
290 295 300 



Phe Gly Glu Ala Leu Val Arg His Asp Glu Phe 
305 310 315 



15 



(2) INFORMATION FOR SEQ ID NO: 205: 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

25 

Met Phe Asp Ala Val Leu lie Leu Leu Leu lie Pro Leu Lys Asp Lys 
15 10 15 

Leu Val Asp Pro lie Leu Arg Arg His Gly Leu Leu Pro Ser Ser Leu 
30 20 25 30 

Lys Arg lie Ala Val Gly Met Phe Phe Val Met Cys Ser Ala Phe Ala 
35 40 45 

35 Ala Gly lie Leu Glu Ser Lys Arg Leu Asn Leu Val Lys Glu Lys Thr 
50 55 60 

lie Asn Gin Thr He Gly Asn Val Val Tyr His Ala Ala Asp Leu Ser 
65 70 75 80 

40 

Leu Trp Trp Gin Val Pro Gin Tyr Leu Leu He Gly He Ser Glu He 
85 90 95 

Phe Ala Ser He Ala Gly Leu Glu Phe Ala Tyr Ser Ala Ala Pro Lys 
45 100 105 110 

Ser Met Gin Ser Ala He Met Gly Leu Phe Phe Phe Phe Ser Gly Val 
115 120 125 

50 Gly Ser Phe Val Gly Ser Gly Leu Leu Ala Leu Val Ser He Lys Ala 
130 135 140 

He Gly Trp Met Ser Ser His Thr Asp Phe Gly Asn He Asn Gly Cys 
145 150 155 160 

55 

Tyr Leu Asn Tyr Tyr Phe Phe Leu Leu Ala Ala He Gin Gly Ala Thr 
165 170 175 



60 



Leu Leu Leu Phe Leu He He Ser Val Lys Tyr Asp His His Arg Asp 
180 185 190 
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His Gin Arg Ser Arg Ala Asn Gly Val Pro Thr Ser Arg Arg Ala 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

15 Met Arg Ser Arg lie Arg Glu Phe Asp Ser Ser Thr Leu Asn Glu Ser 
15 10 15 

Val Arg Asn Thr lie Met Arg Asp Leu Lys Ala Val Gly Lys Lys Phe 
20 25 30 

20 

Met His Val Leu Tyr Pro Arg Lys Ser Asn Thr Leu Leu Arg Asp Trp 
35 40 45 

Asp Leu Trp Gly Pro Leu He Leu Cys Val Thr Leu Ala Leu Met Leu 
25 50 55 60 

Gin Arg Asp Ser Ala Asp Ser Glu Lys Asp Gly Gly Pro Gin Phe Ala 
65 70 75 80 

30 Glu Val Phe Val He Val Trp Phe Gly Ala Val Thr He Thr Leu Asn 
85 90 95 



35 



Ser Lys Leu Leu Gly Gly Asn He Ser Phe Phe Gin Ser Leu Cys Val 
100 105 110 

Leu Gly Tyr Cys He Leu Pro Leu Thr Val Ala Met Leu He Cys Arg 
115 120 125 



Leu Val Leu Leu Ala Asp Pro Gly Pro Val Asn Phe Met Val Arg Leu 
40 130 135 140 

Phe Val Val He Val Met Phe Ala Trp Ser He Val Ala Ser Thr Ala 
145 150 155 160 

45 Phe Leu Ala Asp Ser Gin Pro Pro Asn Arg Arg Ala Leu Ala Val Tyr 
165 170 175 



50 



Pro Val Phe Leu Phe Tyr Phe Val He Ser Trp Met He Leu Thr Phe 
180 185 190 

Thr Pro Gin Xaa 
195 



55 



(2) INFORMATION FOR SEQ ID NO: 207: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

Met Ala Lys Asp Gin Ala Val Glu Asn lie Leu Val Ser Pro Val Val 
5 1 5 10 15 

Val Ala Ser Ser Leu Gly Leu Val Ser Leu Gly Gly Lys Ala Thr Thr 
20 25 30 

10 Ala Ser Gin Ala Lys Ala Val Leu Ser Ala Glu Gin Leu Arg Asp Glu 
35 40 45 



15 



Glu Val His Ala Gly Leu Gly Glu Leu Leu Arg Ser Leu Ser Asn Ser 
50 55 60 

Thr Ala Arg Asn Val Thr Trp Lys Leu Gly Ser Arg Leu Tyr Gly Pro 
65 70 75 80 



Ser Ser Val Ser Phe Ala Asp Asp Phe Val Arg Ser Ser Lys Gin His 
20 85 90 95 

Tyr Asn Cys Glu His Ser Lys lie Asn Phe Arg Asp Lys Arg Ser Ala 
100 105 110 

25 Leu Gin Ser He Asn Glu Trp Ala Ala Gin Thr Thr Asp Gly Lys Leu 
115 120 125 



30 



Pro Glu Val Thr Lys Asp Val Glu Arg Thr Asp Gly Ala Leu Leu Val 
130 135 140 

Asn Ala Met Phe Phe Lys Pro His Trp Asp Glu Lys Phe His His Lys 
145 150 155 160 



Met Val Asp Asn Arg Gly Phe Met Val Thr Arg Ser Tyr Thr Val Gly 
35 165 170 175 

Val Met Met Met His Arg Thr Gly Leu Tyr Asn Tyr Tyr Asp Asp Glu 
180 185 190 

40 Lys Glu Lys Leu Gin He Val Glu Met Pro Leu Ala His Lys Leu Ser 
195 200 205 

Ser Leu He He Leu Met Pro His His Val Glu Pro Leu Glu Arg Leu 
210 215 220 

45 

Glu Lys Leu Leu Thr Lys Glu Gin Leu Lys He Trp Met Gly Lys Met 
225 230 235 240 

Gin Lys Lys Ala Val Ala He Ser Leu Pro Lys Gly Val Val Glu Val 
50 245 250 255 

Thr His Asp Leu Gin Lys His Leu Ala Gly Leu Gly Leu Thr Glu Ala 
260 265 270 

55 He Asp Lys Asn Lys Ala Asp Leu Ser Arg Met Ser Gly Lys Lys Asp 
275 280 285 

Leu Tyr Leu Ala Ser Val Phe His Ala Thr Ala Phe Glu Leu Asp Thr 
290 295 300 

60 
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Asp Gly Asn Pro Leu Thr Arg lie Thr Gly Gly Gly Val Arg Thr Gin 
305 310 315 320 

Val Phe Tyr Ala Asp His Pro Phe He Ser Xaa 
5 325 330 



10 



(2) INFORMATION FOR SEQ ID NO: 208: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

Met Cys Met Gin Leu Phe Gly Phe Leu Ala Phe Met He Phe Met Cys 
15 10 15 

20 Trp Val Gly Asp Val Tyr Pro Val Tyr Gin Pro Val Gly Pro Lys Gin 
20 25 30 

Tyr Pro Tyr Asn Asn Leu Tyr Leu Glu Arg Gly Gly Asp Pro Ser Lys 
35 40 45 



25 



Glu Pro Glu Arg Val Val His Tyr Glu He 
50 55 



30 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

Met Asp Ala Leu Val Glu Asp Asp He Cys He Leu Asn His Glu Lys 
40 1 5 10 15 

Ala His Lys Arg Asp Thr Val Thr Pro Val Ser He Tyr Ser Gly Asp 
20 25 30 

45 Glu Ser Val Ala Ser His Phe Ala Leu Val Thr Ala Tyr Glu Asp He 
35 40 45 

Lys Lys Arg Leu Lys Asp Ser Glu Lys Glu Asn Ser Leu Leu Lys Lys 
50 55 60 

50 

Arg He Arg Phe Leu Glu Glu Lys Leu He Ala Arg Phe Glu Glu Glu 
65 70 75 80 

Thr Ser Ser Val Gly Arg Glu Gin Val Asn Lys Ala Tyr His Ala Tyr 
55 85 90 95 

Arg Glu Val Cys He Asp Arg Asp Asn Leu Lys Ser Lys Leu Asp Lys 
100 105 110 



60 Met Asn Lys 



Asp Asn Ser Glu Ser Leu Lys 



Val Leu Asn Glu Gin Leu 



WO 98/42738 



PCT/US98/05311 



324 



115 



120 



125 



Gin Ser Lys Glu Val Glu Leu Leu Gin Leu Arg Thr Glu Val Glu Thr 
130 135 140 

5 

Gin Gin Val Met Arg Asn Leu Asn Pro Pro Ser Ser Asn Trp Glu Val 
145 150 155 160 

Glu Lys Leu Ser Cys Asp Leu Lys lie His Gly Leu Glu Gin Glu Leu 
10 165 170 175 

Glu Leu Met Arg Lys Glu Cys Ser Asp Leu Lys lie Glu Leu Gin Lys 
180 185 190 

15 Ala Lys Gin Thr Asp Pro Tyr Gin Glu Asp Asn Leu Lys Ser Arg Asp 
195 200 205 



20 



Leu Gin Lys Leu Ser He Ser Ser Asp Asn Met Gin His Ala Tyr Trp 
210 215 220 

Glu Leu Lys Arg Glu Met Ser Asn Leu His Leu Val Thr Gin Val Gin 
225 230 235 240 



Ala Glu Leu Leu Arg Lys Leu Lys Thr Ser Thr Ala He Lys Lys Ala 
25 245 250 255 

Cys Ala Pro Val Gly Cys Ser Glu Asp Leu Gly Arg Asp Ser Thr Lys 
260 265 270 

30 Leu His Leu Met Asn Phe Thr Ala Thr Tyr Thr Arg His Pro Pro Leu 
275 280 285 



35 



Leu Pro Asn Gly Lys Ala Leu Cys His Thr Thr Ser Ser Pro Leu Pro 
290 295 300 

Gly Asp Val Lys Val Leu Ser Glu Lys Ala He Leu Gin Ser Trp Thr 
305 310 315 320 



Asp Asn Glu Arg Ser He Pro Asn Asp Gly Thr Cys Phe Gin Glu His 
40 325 330 335 

Ser Ser Tyr Gly Arg Asn Ser Leu Glu Asp Asn Ser Trp Val Phe Pro 
340 345 350 

45 Ser Pro Pro Lys Ser Ser Glu Thr Ala Phe Gly Glu Thr Lys Thr Lys 
355 360 365 



50 



Thr Leu Pro Leu Pro Asn Leu Pro Pro Leu His Tyr Leu Asp Gin His 
370 375 380 

Asn Gin Asn Cys Leu Tyr Lys Asn 
385 390 



55 

(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
60 (B) TYPE: amino acid 
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10 



25 



(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

Met His His His Thr Gin Leu Met Phe lie Tyr Leu Phe lie Tyr Leu 
15 10 15 

Phe He Leu Gly Val Phe Phe Phe Phe Phe Xaa 
20 25 



(2) INFORMATION FOR SEQ ID NO: 211: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

20 Met Asn Cys He Leu Leu Leu Tyr Leu Leu He Pro Thr He Ser He 
15 10 15 

Ser Val Val Pro Tyr Val Ala Leu Asn He Lys Tyr He Lys Glu Cys 
20 25 30 



Thr Glu Asn Ser Phe Tyr Xaa 
35 



30 

(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 212: 

Met Leu Leu His Leu Thr Ala Ala Phe Leu Gin Arg Ala Gin Phe Ser 
40 1 5 10 15 

Thr Tyr Phe Pro Gly Tyr Phe Asp Gly Gin Tyr Trp Leu Trp Trp Val 
20 25 30 

45 Phe Leu Val Leu Gly Phe Leu Leu Phe Leu Arg Gly Phe He Asn Tyr 
35 40 45 

Ala Lys Val Arg Lys Met Pro Glu Thr Phe Ser Asn Leu Pro Arg Thr 
50 55 60 

50 

Arg Val Leu Phe He Tyr Xaa 
65 70 



55 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 
60 (B) TYPE: amino acid 
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<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

Met Leu Thr Phe Phe Met Ala Phe Leu Phe Asn Trp He Gly Phe Phe 
5 1 5 10 15 

Leu Ser Phe Cys Leu Thr Thr Ser Ala Ala Gly Arg Tyr Gly Ala He 
20 25 30 

10 Ser Gly Phe Gly Leu Ser Leu He Lys Trp He Leu He Val Arg Phe 
35 40 45 

Ser Thr Tyr Phe Pro Ala Phe Met Asn Ser Leu Ser Arg Ser Lys Arg 
50 55 60 

15 

Thr Pro Ala Gly Ser Glu Ser Arg Cys Arg Thr Gin Arg Asn Asn His 
65 70 75 80 

Leu Leu Xaa 

20 



25 



(2) INFORMATION FOR SEQ ID NO: 214: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

Met Ser Lys Arg Ser Ala Ser Phe He Leu Leu Pro Leu Leu Phe Leu 
1 5 10 15 

35 Lys Gly Ser Phe Ala Lys Leu Asn Ala Arg He Ser Asp Cys Leu Glu 
20 25 30 



40 



50 



Glu Arg Tyr Cys His Asn Leu Trp Met Val Phe Gin Gly Cys Val He 
35 40 45 

Thr Glu Leu His Leu Ser Arg Met Ser Lys Thr Leu Ser Ser Leu Cys 
50 55 60 



Tyr Asp Phe Val lie Asn Val Tyr He Phe Phe Lys Phe Leu Asp He 
45 65 70 75 80 



Thr 



(2) INFORMATION FOR SEQ ID NO: 215: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 



60 Met Cys Ser Leu Phe Glu Ser Arg Phe Phe Cys Phe Val Leu Phe Ser 
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10 



15 



Glu Lys He He Gin Leu Cys Ala Ser He Ala Phe Leu Cys Phe Val 
20 25 30 

Lys His Val Pro Trp Pro Lys Trp Lys Arg Lys Cys Leu He Asn Ala 
35 40 45 



10 



Phe 



15 



(2) INFORMATION FOR SEQ ID NO: 216: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 203 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

Met Thr Leu Arg Pro Ser Leu Leu Pro Leu His Leu Leu Leu Leu Leu 
15 10 15 

25 Leu Leu Ser Ala Ala Val Cys Arg Ala Glu Ala Gly Leu Glu Thr Glu 
20 25 30 

Ser Pro Val Arg Thr Leu Gin Val Glu Thr Leu Val Glu Pro Pro Glu 
35 40 45 



30 



Pro Cys Ala Glu Pro Ala Ala Phe Gly Asp Thr Leu His lie His Tyr 
50 55 60 



Thr Gly Ser Leu Val Asp Gly Arg He He Asp Thr Ser Leu Thr Arg 
35 65 70 75 80 

Asp Pro Leu Val He Glu Leu Gly Gin Lys Gin Val He Pro Gly Leu 
85 90 95 

40 Glu Gin Ser Leu Leu Asp Met Cys Val Gly Glu Lys Arg Arg Ala He 
100 105 110 

He Pro Ser His Leu Ala Tyr Gly Lys Arg Gly Phe Pro Pro Ser Val 
115 120 125 

45 

Pro Ala Asp Ala Val Val Gin Tyr Asp Val Glu Leu He Ala Leu He 
130 135 140 

Arg Ala Asn Tyr Trp Leu Lys Leu Val Lys Gly lie Leu Pro Leu Val 
50 145 150 155 160 

Gly Met Ala Met Val Pro Pro Ser Trp Ala Ser Leu Gly He Thr Tyr 
165 170 175 

55 Thr Glu Arg Pro He Asp Pro Lys Ser Pro Lys Arg Ser Ser Arg Lys 
180 185 190 

Arg Asn Glu Thr Arg Ala Lys Arg Asn Asn Lys 
195 200 

60 
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(2) INFORMATION FOR SEQ ID NO: 217: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

10 

Met Lys Thr Leu Met Thr lie Cys Pro Gly Thr Val Leu Leu Val Phe 
15 10 15 

Ser lie Ser Leu Trp lie lie Ala Ala Trp Thr Val Arg Val Cys Glu 
15 20 25 30 

Ser Pro Glu Ser Pro Ala Gin Pro Ser Gly Ser Ser Leu Pro Ala Trp 
35 40 45 

20 Tyr His Asp Gin Gin Asp Val Thr Ser Asn Phe Leu Gly Ala Met Trp 
50 55 60 

Leu lie Ser lie Thr Phe Leu Ser lie Gly Tyr Gly Asp Met Val Pro 
65 70 75 80 

25 

His Thr Tyr Cys Gly Lys Gly Val Cys Leu Leu Thr Gly lie Met Gly 
85 90 95 

Ala Gly Cys Thr Ala Leu Val Val Ala Val Val Ala Arg Lys Leu Glu 
30 100 105 110 

Leu Thr Lys Ala Glu Lys His Val His Xaa Phe Met Met Asp Thr Gin 
115 120 125 

35 Leu Thr Lys Arg lie Lys Asn Xaa Ala Ala Asn Val Leu Xaa Glu Thr 
130 135 140 

Trp Leu lie Tyr Lys His Thr Lys Leu Leu Lys Lys lie Asp His Ala 
145 150 155 160 

40 

Lys Val Arg Asn Thr Arg Gly Ser Ser Ser Lys Tyr Pro Pro Val Glu 
165 170 175 

Glu Arg Gin Asp Gly Thr Glu Glu Ala Glu 
45 180 185 



(2) INFORMATION FOR SEQ ID NO: 218: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

Met Lys Phe Leu Ala Val Leu Val Leu Leu Gly Val Ser lie Phe Leu 
15 10 15 



60 Val Ser Ala Gin Asn Pro Thr Thr Ala Ala Pro Ala Asp Thr Tyr Pro 
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20 25 30 

Ala Thr Gly Pro Ala Asp Asp Glu Ala Pro Asp Ala Glu Thr Thr Ala 
35 40 45 

5 

Ala Ala Thr Thr Ala Thr Thr Ala Ala Pro Thr Thr Ala Thr Thr Ala 
50 55 60 

Ala Ser Thr Thr Ala Arg Lys Asp lie Pro Val Leu Pro Lys Trp Val 
10 65 70 75 80 

Gly Asp Leu Pro Asn Gly Arg Val Cys Pro 
85 90 



15 



(2) INFORMATION FOR SEQ ID NO: 219: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

25 Met Ser Ser Ala Ala Ala Asp His Trp Ala Trp Leu Leu Val Leu Ser 
1 5 10 15 

Phe Val Phe Gly Cys Asn Val Leu Arg lie Leu Leu Pro Ser Phe Ser 
20 25 30 

30 

Ser Phe Met Ser Arg Val Leu Gin Lys Asp Ala Glu Gin Glu Ser Gin 
35 40 45 

Met Arg Ala Glu He Gin Asp Met Lys Gin Glu Leu Ser Thr Val Asn 
35 50 55 60 

Met Met Asp Glu Phe Ala Arg Tyr Ala Arg Leu Glu Arg Lys He Asn 
65 70 75 80 

40 Lys Met Thr Asp Lys Leu Lys Thr His Val Lys Ala Arg Thr Ala Gin 

85 90 95 

Leu Ala Lys He Lys Trp Val He Ser Val Ala Phe Tyr Val Leu Gin 
100 105 110 

45 

Ala Ala Leu Met He Ser Leu He Trp Lys Tyr Tyr Ser Val Pro Val 
115 120 125 

Ala Val Val Pro Ser Lys Trp He Thr Leu Xaa 
50 130 135 



(2) INFORMATION FOR SEQ ID NO: 220: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 
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Met Ser Ser Ala Ala Ala Asp His Trp Ala Trp Leu Leu Val Leu Ser 
15 10 15 

5 Phe Val Phe Gly Cys Asn Val Leu Arg He Leu Leu Pro Ser Phe Ser 
20 25 30 



Ser Phe Met Ser Arg Val Leu Gin Lys Asp Ala Asp Arg Ser His Arg 
35 40 45 



10 



15 



35 



(2) INFORMATION FOR SEQ ID NO: 221: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

Met Thr Ala Pro Leu Pro Pro Leu Ser Gly Leu Ala Leu Phe Leu He 
25 1 5 10 15 

Val Phe Phe Ser Leu Gly Val Phe Cys He Cys His Ser His Trp Tyr 
20 25 30 

30 His Thr Leu Gin Gin Met Ala Gly Thr Glu Pro Lys Ala Leu Leu Leu 
35 40 45 



Ser Pro Pro Ala Ala Thr Thr Phe Val Thr Val Thr His Glu Val Trp 
50 55 60 

Lys Glu Gin Ala Leu Ala 
65 70 



40 

(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

Met Thr Cys Ser Val Ala Leu Leu Leu He Leu Gly Leu Arg Cys Ser 
50 1 5 10 15 

Gly Val Arg Pro Gly Leu Val Gly Glu Gly His Asn Pro Ser Leu Leu 
20 25 30 

55 Val Cys Leu Leu Leu Lys Asp Ser Arg Thr Asn Gin Gly Ser Cys Pro 
35 40 45 



60 



Gly Gly Pro Trp Ser Glu Arg Asp He Glu Ser Val Thr Ser Asp Asn 
50 55 60 
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Cys Glu Ala Thr Leu Gly Tyr Arg Asn His Ser Leu Pro Ser Asn Tyr 
65 70 75 80 



Tyr Asn Ser 

5 



(2) INFORMATION FOR SEQ ID NO: 223: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

Met Leu Thr Arg Ser Leu Lys Thr Leu Pro Ser Ala Cys Thr Ala Phe 
15 10 15 

20 Leu Leu Leu Phe Phe Leu Phe Ser Ser Gly Asp Pro Glu Leu Ser Cys 
20 25 30 

Ser Cys Thr Leu Arg Thr Gin Ser Ser Trp Ser 
35 40 

25 



(2) INFORMATION FOR SEQ ID NO: 224: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

35 

Met Trp Arg Pro Ser Val Leu Leu Leu Leu Leu Leu Leu Arg His Gly 
15 10 15 

Ala Gin Gly Lys Pro Ser Pro Asp Ala Gly Pro His Gly Gin Gly Arg 
40 20 25 30 

Val His Gin Ala Ala Pro Leu Ser Asp Ala Pro His Asp Asp Ala His 
35 40 45 

45 Gly Asn Phe Gin Tyr Asp His Glu Ala Phe Leu Gly Arg Glu Val Ala 
50 55 60 

Lys Glu Phe Asp Gin Leu Thr Pro Glu Glu Ser Gin Ala Arg Leu Gly 
65 70 75 80 

50 

Arg He Val Asp Arg Met Asp Arg Ala Gly Asp Gly Asp Gly Trp Val 
85 90 95 

Ser Leu Ala Glu Leu Arg Ala Trp He Ala His Thr Gin Gin Arg His 
55 100 105 110 

He Arg Asp Ser Val Ser Ala Ala Trp Asp Thr Tyr Asp Thr Asp Arg 
115 120 125 



60 Asp Gly Arg Val Gly Trp Glu Glu Leu Arg Asn Xaa Thr Tyr Gly His 
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130 135 140 

Xaa Xaa Pro Xaa Glu Glu Phe His Asp Val Glu Asp Ala Glu Thr Tyr 
145 150 155 160 

5 

Lys Lys Met Leu Xaa Arg Asp Glu Arg Arg Phe Arg Val Ala Asp Gin 
165 170 175 

Asp Gly Asp Ser Met Ala Thr Arg 
10 180 



15 



(2) INFORMATION FOR SEQ ID NO: 225: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

Met Trp Leu Phe lie Leu Leu Ser Leu Ala Leu lie Ser Asp Ala Met 
15 10 15 

25 Val Met Asp Glu Lys Val Lys Arg Ser Leu Cys Trp Thr Arg Leu Leu 
20 25 30 

Pro Ser Ala Thr Thr Met Pro Xaa Thr Arg lie Thr Pro Asn Thr Gly 
35 40 45 



30 



Ala Glu Xaa He Ser Val Xaa Thr Ala Thr Ser Ser Pro Ser Pro Leu 
50 55 60 



Thr Ala Pro He Met Trp Pro 
35 65 70 



40 



(2) INFORMATION FOR SEQ ID NO: 226: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 226: 



Met His Val Phe Val Leu Glu He Phe Leu 
15 10 

50 

(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

60 Met Ala Val Ala Thr Leu Ala Ser Glu Thr Leu Pro Leu Leu Ala Leu 
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15 10 15 

Thr Phe lie Thr Asp Asn Ser Leu Val Ala Ala Gly His Asp Cys Phe 
20 25 30 

5 

Pro Val Leu Phe Thr Tyr Asp Ala Ala Ala Gly Met Leu Ser Phe Gly 
35 40 45 

Gly Arg Leu Asp Val Pro Lys Gin Ser Ser Gin Arg Gly Leu Thr Ala 
10 50 55 60 

Arg Glu Arg Phe Gin Asn Leu Asp Lys Lys Ala Ser Ser Glu Gly Gly 
€5 70 75 80 

15 Thr Ala Ala Gly Ala Gly Leu Asp Ser Leu His Lys Asn Ser Val Ser 
85 90 95 

Gin lie Ser Val Leu Ser Gly Gly Lys Ala Lys Cys Ser Gin Phe Cys 
100 105 110 

Thr Thr Gly Met Asp Gly Gly Met Ser He Trp Asp Val Lys Ser Leu 
115 120 125 



20 



Glu Ser Ala Leu Lys Asp Leu Lys He Lys 
25 130 135 



(2) INFORMATION FOR SEQ ID NO: 228: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

Leu Gly Ser Leu Ser Thr Ala Pro Ser Ser Ala Leu Pro Thr Leu Gly 
15 10 15 

40 Ala Arg Arg Thr Arg Ser Lys 
20 



45 (2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

Met Thr Tyr Phe Ser Gly Leu Leu Val He Leu Ala Phe Ala Ala Trp 
15 10 15 

55 

Val Ala Leu Ala Glu Gly Leu Gly Val Ala Val Tyr Ala Ala Ala Val 
20 25 30 

Leu Leu Gly Ala Gly Cys Ala Thr He Leu Val Thr Ser Leu Ala Met 
60 35 40 45 
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Thr Ala Asp Leu lie Gly Pro His Thr Asn Ser Gly Ala Phe Val Tyr 
50 55 60 

5 Gly Ser Met Ser Phe Leu Asp Lys Val Ala Asn Gly Leu Ala Val Met 
65 70 75 80 



10 



20 



Ala lie Gin Ser Leu His Pro Cys Pro Ser Glu Leu Cys Cys Arg Ala 
85 90 95 

Cys Val Ser Phe Tyr His Trp Ala Met Val Ala Val Thr Gly Gly Val 
100 105 110 



Gly Val Ala Ala Ala Leu Cys Leu Cys Ser Leu Leu Leu Trp Pro Thr 
15 115 120 125 



Arg Leu Arg Arg Xaa 
130 



(2) INFORMATION FOR SEQ ID NO: 230: 



(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

30 Gly Lys Pro Thr Gly Lys Ser Leu Pro Leu Met Trp Met He Leu Met 
15 10 15 

Gin Pro He He Met He Ser Met Met Ser Asn Gly 
20 25 

35 



(2) INFORMATION FOR SEQ ID NO: 231: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 



45 



Met Gin Gly Lys Phe Met Lys Val Gin Val Tyr Arg Phe Leu Lys Tyr 
15 10 15 



Leu Leu Met Leu Leu Cys Met Phe Val Asn Arg Gly Met Ser Lys Asp 
50 20 25 30 

Ser Thr Lys Lys Pro Gly Gin Glu Lys Leu Lys Val Ser Leu Gly Ser 
35 40 45 

55 He Leu Asn Met Lys Ser Gin Arg Pro Leu Ser Trp Cys 
50 55 60 



60 (2) INFORMATION FOR SEQ ID NO: 232: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

Met Met Glu Arg Ser Met Met He Leu Leu Met Ala Ala Ser Met Thr 
15 10 15 

10 

Met Thr Ser Thr Gin Leu Trp Ser Phe Cys Cys Val His 
20 25 



15 

(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

Met Trp Tyr Gin Leu Ala Lys Glu Glu Pro Gly Val Gly Ala Cys Ala 
25 1 5 10 15 

Leu Asp 



30 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

40 Leu Xaa 
1 



45 (2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

Met Leu lie Cys Arg Leu Val Leu Leu Ala Asp Pro Gly Pro Val Asn 
15 10 15 

55 

Phe Met Val Arg Leu Phe Val Val He Val Met Phe Ala Trp Ser He 
20 25 30 

Val Ala Ser Thr Ala Phe Leu Ala Asp Ser Gin Pro Pro Asn Arg Arg 
60 35 40 45 
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Ala Leu Ala Val Tyr Pro Val Phe Leu Phe Tyr Phe Val lie Ser Trp 
50 55 60 

5 Met He I^u Thr Phe Thr Pro Gin 
65 70 



10 (2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 
15 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

Met Arg Ser Leu Leu Leu Leu Ser Ala Phe Cys Leu Leu Glu Ala Ala 
15 10 15 

20 

Leu Ala Ala Glu Val Lys Lys Pro Ala Ala Ala Ala Ala Pro Gly Thr 
20 25 30 

Ala Glu Lys Leu Ser Pro Lys Ala Ala Thr Leu Ala Glu Arg Xaa Pro 
25 35 40 45 

Ala Trp Pro Ser Ala Cys Thr Arg Pro Trp Pro Arg Thr Arg Gin Trp 
50 55 60 

30 Arg Thr Ser Trp Cys His Pro Trp Trp Trp Pro Arg Arg Trp Gly Ser 
65 70 75 80 

Cys Arg Trp Ala Ala Arg Arg Pro Arg Arg Arg Arg Pro Arg Gin Cys 
85 90 95 

35 



40 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 237: 

Met Arg Ser Leu Leu Leu Leu Ser Ala Phe Cys Leu Leu Glu Ala Ala 
50 1 5 10 _ 15 

Leu Ala Ala Glu Val Lys Lys Pro Ala Ala Ala Ala Ala Pro Gly Thr 
20 25 30 

55 Ala Glu Lys Leu Ser Pro Lys Ala Ala Thr Leu Ala Glu Arg Lys Arg 
35 40 45 

Pro Gly Leu Gin Leu Val Pro Gly His Gly Gin Gly Pro Gly Ser Gly 
50 55 60 

60 
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Glu His Pro Gly Val Thr Arg Gly Gly Gly Leu Val Ala Gly Ala Arg 
65 70 75 80 

Val Ala Gly Arg Gin Gly Asp His Gly Val Ala Gly Gin Gly Ser Ala 
5 85 90 95 

Glu Arg Arg Ala Ala Ala Arg Arg Gly Gly Ala Arg Arg Pro Gly Arg 
100 105 110 

10 Ala Ala Ala Leu Thr Gin Gin Leu His Gly Ala Gin Arg Asp Leu Glu 
115 120 125 



Ala Gly Gin Pro Thr Val Arg Thr Gin Leu Ser Glu Leu Arg Xaa 
130 135 140 



15 



(2) INFORMATION FOR SEQ ID NO: 238: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

25 

Met Arg Ser Leu Leu Leu Leu Ser Ala Phe Cys Leu Leu Glu Ala Ala 
15 10 15 

Leu Ala Ala Glu Val Lys Lys Pro Ala Ala Ala Ala Ala Pro Gly Thr 
30 20 25 30 

Ala Glu Lys Leu Ser Pro Lys Ala Ala Thr Leu Ala Glu Arg Xaa Arg 
35 40 45 

35 Pro Gly Leu Gin Leu Val Pro Gly His Gly Gin Gly Pro Gly Ser Gly 
50 55 60 

Glu His Pro Gly Val Thr Arg Gly Gly Gly Leu Val Ala Gly Ala Arg 
65 70 75 80 

40 

Val Ala Gly Arg Gin Gly Asp His Gly Val Ala Gly Gin Gly Ser Ala 
85 90 95 

Glu Arg Arg Ala Ala Ala Arg Arg Gly Gly Ala Arg Arg Pro Gly Arg 
45 100 105 110 

Ala Ala Ala Leu Thr Gin Gin Leu Xaa Gly Ala Gin Arg Asp Leu Glu 
115 120 125 

50 Ala Gly Gin Pro Thr Val Arg Thr Gin Leu Ser Glu Leu Arg 
130 135 140 



55 (2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

Asp Pro Glu Ala Ala Asp Ser Gly Glu Pro Gin Asn Lys Arg Thr Pro 
15 10 15 

5 

Asp Leu Pro Glu Glu Glu Tyr Val Lys Glu Glu He Gin Glu Asn Glu 
20 25 30 

Glu Ala Val Lys Lys Met Leu Val Glu Ala Thr Arg Glu Phe Glu Glu 
10 35 40 45 

Val Val Val Asp Glu Ser 
50 



15 



(2) INFORMATION FOR SEQ ID NO: 240: 



(i) SEQUENCE CHARACTERISTICS: 
20 - (A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

25 Gin Lys Leu Lys Arg Lys Ala Glu Glu Asp Pro Glu Ala Ala Asp Ser 
1 5 10 15 

Gly Glu Pro Gin Asn Lys Arg Thr Pro Asp Leu Pro Glu Glu Glu Tyr 
20 25 30 



30 



Val Lys Glu Glu He Gin Glu Asn Glu Glu Ala Val Lys Lys Met Leu 
35 40 45 



Val Glu Ala Thr Arg Glu Phe Glu Glu Val Val Val Asp Glu Ser 
35 50 55 60 



(2) INFORMATION FOR SEQ ID NO: 241: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

Lys Ala Met Glu Lys Ser Ser Leu Thr Gin His Ser Trp Gin Ser Leu 
15 10 15 

50 Lys Asp Arg Tyr Leu Lys His Leu Arg Gly Gin Glu His Lys Tyr Leu 
20 25 30 

Leu Gly Asp Ala Pro Val Ser Pro Ser Ser Gin Lys Leu Lys Arg Lys 
35 40 45 

55 

Ala Glu Glu Asp Pro Glu Ala Ala Asp Ser Gly Glu Pro Gin Asn Lys 
50 55 60 

Arg Thr Pro Asp Leu Pro Glu Glu Glu Tyr Val Lys Glu Glu He Gin 
60 65 70 75 80 
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Glu Asn Glu Glu Ala Val Lys Lys Met Leu Val GXu Ala Thr Arg Glu 
85 90 95 

5 Phe Glu Glu Val Val Val Asp Glu Ser Pro Pro Asp Phe Glu He His 
100 105 HO 

He 

10 



(2) INFORMATION FOR SEQ ID NO: 242: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

20 

Leu Pro Ser Tyr Asp Glu Ala Glu Arg Thr Lys Ala Glu Ala Thr He 
1 5 10 15 

Pro Leu Val Pro Gly Arg Asp Glu Asp Phe Val Gly Arg Asp Asp Phe 
25 20 25 30 

Asp Asp Ala Asp Gin Leu Arg He Gly Asn Asp Gly He Phe Met Leu 
35 40 45 

30 Thr Phe Phe Met Ala Phe Leu Phe Asn Trp He Gly Phe Phe Leu Ser 
50 55 60 



Phe Cvs Leu Thr Thr Ser Ala Ala Gly Arg Tyr Gly Ala He Ser Gly 

-* — t- OA 

35 



65 70 75 80 



Phe Gly Leu Ser Leu He Lys Trp He Leu He Val Arg Phe Ser Thr 
85 90 95 

Tyr Phe Pro Gly Tyr Phe Asp Gly Gin Tyr Trp Leu Trp Trp Val Phe 

40 ioo 105 no 

Leu Val Leu Gly Phe Leu Leu Phe Leu Arg Gly Phe He Asn Tyr Ala 
115 120 125 

45 Lys Val Arg Lys Met Pro Glu Thr Phe Ser Asn Leu Pro Arg Thr Arg 
130 135 140 

Val Leu Phe He 
145 

50 



(2) INFORMATION FOR SEQ ID NO: 243: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

60 
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Ala Gly Arg Tyr Gly Ala lie Ser Gly Phe Gly Leu Ser Leu lie Lys 
15 10 15 



Trp He Leu He Val Arg Phe Ser 
5 20 



10 



(2) INFORMATION FOR SEQ ID NO: 244; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

Met Lys His Leu Ser Ala Trp Asn Phe Thr Lys Leu Thr Phe Leu Gin 
1 5 10 15 

20 Leu Trp Glu He Phe Glu Gly Ser Val Glu Asn Cys Gin Thr Leu Thr 
20 25 30 

Ser Tyr Ser Lys Leu Gin He Lys Tyr Thr Phe Ser Arg Gly Ser Thr 
35 40 45 



25 



Phe Tyr He 
50 



30 



(2) INFORMATION FOR SEQ ID NO: 245: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

Phe Ser Ser Asp Phe Arg Thr Ser Pro Trp Glu Ser Arg Arg Val Glu 
40 1 5 10 15 

Ser Lys Ala Thr Ser Ala Arg Cys Gly Leu Trp Gly Ser Gly Pro Arg 
20 25 30 

45 Arg Arg Pro Ala Ser Gly Met Phe Arg Gly Leu Ser Ser Trp Leu Gly 
35 40 45 

Leu Gin Gin Pro Val Ala Gly Gly Gly Gin Pro Asn Gly Asp Ala Pro 
50 55 60 

50 

Pro Glu Gin Pro Ser Glu Thr Val Ala Glu Ser Ala Glu Glu Glu Leu 
65 70 75 80 

Gin Gin Ala Gly Asp Gin Glu Leu Leu His Gin Ala Lys Asp Phe Gly 
55 85 90 95 

Asn Tyr Leu Phe Asn Phe Ala Ser Ala Ala Thr Lys Lys He Thr Glu 
100 105 110 



60 Ser Val Ala Glu Thr Ala Gin Thr He Lys Lys Ser Val Glu Glu Gly 
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115 120 125 

Lys He Asp Gly He He Asp Lys Thr He He Gly Asp Phe Gin Lys 
130 135 140 

Glu Gin Lys Lys Phe Val Glu Glu Gin His Thr Lys Lys Ser Glu Ala 
145 150 155 160 

Ala Val Pro Pro Trp Val Asp Thr Asn Asp Glu Glu Thr He Gin Gin 
165 170 175 

Gin He Leu Ala Leu Ser Ala Asp Lys Arg Asn Phe Leu Arg Asp Pro 
180 185 190 

15 Pro Ala Gly Val Gin Phe Asn Phe Asp Phe Asp Gin Met Tyr Pro Val 
195 200 205 

Ala Leu Val Met Leu 
210 

20 



(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

Met Arg Phe Ala Leu Val Pro Lys Leu Val Lys Glu Glu Val Phe Trp 
15 10 15 

Arg Asn Tyr Phe Tyr Arg Val Ser Leu He Lys Gin Ser Ala Gin Leu 
35 20 25 30 

Thr Ala Leu Ala Ala Gin Gin Gin Ala Ala Gly Lys Gly Gly Glu Glu 
35 40 45 

40 Gin 



25 



30 



45 (2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

Ser Thr Ser Pro Gly Val Ser Glu Phe Val Ser Asp Ala Phe Asp Ala 
15 10 15 

55 

Cys Asn Leu Asn Gin Glu Asp Leu Arg Lys Glu Met Glu Gin Leu Val 
20 25 30 

Leu Asp Lys Lys Gin Glu Glu Thr Ala Val Leu Glu Glu Asp Ser Ala 
60 35 40 45 
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Asp Trp Glu Lys Glu Leu Gin Gin Glu Leu Gin Glu Tyr Glu Val Val 
50 55 60 

5 Thr Glu Ser Glu Lys Arg Asp Glu Asn Trp Asp Lys 
65 70 75 



10 (2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 



20 



30 



45 



55 



Ser Pro Trp Glu Ser Arg Arg Val Glu Ser Lys Ala Thr Ser Ala Arg 
15 10 15 

Cys Gly Leu Trp Gly Ser Gly Pro Arg Arg Arg Pro Ala Ser Gly Met 
20 25 30 



Phe Arg Gly Leu Ser Ser Trp Leu Gly Leu Gin Gin Pro Val Ala Gly 
25 35 40 45 



Gly Gly Gin Pro Asn Gly Asp Ala Pro Pro Glu Gin Pro Ser 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 249: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

40 Pro Val Ala Gly Gly Gly Gin Pro Asn Gly Asp Ala Pro Pro Glu Gin 
15 10 15 

Pro Ser Glu Thr Val Ala Glu Ser Ala Glu Glu Glu Leu Gin Gin Ala 
20 25 30 



Gly Asp Gin Glu Leu Leu His Gin Ala Lys Asp Phe Gly Asn Tyr Leu 
35 40 45 



Phe Asn Phe Ala Ser Ala Ala Thr Lys Lys lie Thr Glu Ser Val Ala 
50 50 55 60 



Glu 
65 



(2) INFORMATION FOR SEQ ID NO: 250: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 
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10 



20 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

Phe Gin Lys Glu Gin Lys Lys Phe Val Glu Glu Gin His Thr Lys Lys 
1 5 10 15 

Ser Glu Ala Ala Val Pro Pro Trp Val Asp Thr Asn Asp Glu Glu Thr 
20 25 30 

lie Gin Gin Gin lie Leu Ala Leu Ser Ala Asp Lys Arg Asn Phe Leu 
35 40 45 



Arg Asp Pro Pro Ala Gly Val Gin Phe Asn Phe Asp Phe Asp Gin Met 
15 50 55 60 



Tyr Pro Val Ala Leu Val Met Leu 
65 70 



(2) INFORMATION FOR SEQ ID NO: 251: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

30 Pro Phe lie Cys Val Ala Arg Asn Pro Val Ser Arg Asn Phe Ser Ser 
1 5 10 15 , 

Pro lie Leu Ala Arg Lys Leu Cys Glu Gly Ala Ala 
20 25 

35 



(2) INFORMATION FOR SEQ ID NO: 252: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 252: 

45 

Lys Glu Asp Pro Ala Asn Thr Val Tyr Ser Thr Val Glu He Pro Lys 
15 10 15 

Lys Met Glu Asn Pro His Ser Leu Leu Thr Met Pro Asp Thr Pro Arg 
50 20 25 30 

Leu 



55 

(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 227 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

5 Ala Ser Ala Val Leu Leu Asp Leu Pro Asn Ser Gly Gly Glu Ma Gin 
1 5 10 15 

Ala Lys Lys Leu Gly Asn Asn Cys Val Phe Ala Pro Ala Asp Val Thr 
20 25 30 

10 

Ser Glu Lys Asp Val Gin Thr Ala Leu Ala Leu Ala Lys Gly Lys Phe 
35 40 45 

Gly Arg Val Asp Val Ala Val Asn Cys Ala Gly He Ala Val Ala Ser 
15 50 55 60 

Lys Thr Tyr Asn Leu Lys Lys Gly Gin Thr His Thr Leu Glu Asp Phe 
65 70 75 80 

20 Gin Arg Val Leu Asp Val Asn Leu Met Gly Thr Phe Asn Val He Arg 

85 90 95 

Leu Val Ala Gly Glu Met Gly Gin Asn Glu Pro Asp Gin Gly Gly Gin 
100 105 HO 

25 

Arg Gly Val He He Asn Thr Ala Ser Val Ala Ala Phe Glu Gly Gin 
115 120 125 

Val Gly Gin Ala Ala Tyr Ser Ala Ser Lys Gly Gly He Val Gly Met 
30 130 135 140 

Thr Leu Pro He Ala Arg Asp Leu Ala Pro He Gly He Arg Val Met 
145 150 155 160 

35 Thr He Ala Pro Gly Leu Phe Gly Thr Pro Leu Leu Thr Ser Leu Pro 
165 170 175 

Glu Lys Val Cys Asn Phe Leu Ala Ser Gin Val Pro Phe Pro Ser Arg 
180 185 190 

40 

Leu Gly Asp Pro Ala Glu Tyr Ala His Leu Val Gin Ala He He Glu 
195 200 205 

Asn Pro Phe Leu Asn Gly Glu Val He Arg Leu Asp Gly Ala He Arg 
45 210 215 220 

Met Gin Pro 
225 

50 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 



60 Ser Val Ala Ala Phe Glu Gly Gin Val Gly Gin Ala Ala Tyr Ser Ala 
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10 



15 



Ser Lys Gly Gly He Val Gly Met Thr Leu Pro He Ala 
20 25 



(2) INFORMATION FOR SEQ ID NO: 255: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 255: 

15 

Ala Arg Arg Ser Gly Ala Glu Leu Ala Trp Asp Tyr Leu Cys Arg Trp 
1 5 10 15 

Ala Gin Lys His Lys Asn Trp Arg Phe Gin Lys Thr Arg Gin Thr Trp 
20 20 25 30 

Leu Leu Leu His Met Tyr Asp Ser Asp Lys Val Pro Asp Glu His Phe 
35 40 45 

25 Ser Thr Leu Leu Ala Tyr Leu Glu Gly Leu Gin Gly Arg 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 256: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

His Pro He Glu Trp Ala He Asn Ala Ala Thr Leu Ser Gin Phe Tyr 
15 10 15 

40 He Asn Lys Leu Cys Phe 
20 



45 (2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 



55 



Cys Trp He Lys Tyr Cys Leu Thr Leu Met Gin Asn Ala Gin Leu Ser 
15 10 15 

Met Gin Asp Asn He Gly 
20 



60 
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(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

Lys Val Ser Tyr Leu Arg Pro Leu Asp Phe Glu Glu Ala Arg Glu Leu 
10 1 5 10 15 

Phe Leu Leu Gly Gin His Tyr Val Phe 
20 25 



15 



(2) INFORMATION FOR SEQ ID NO: 259: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

25 Met Glu Arg Arg Cys Lys Met His Lys Arg Xaa He Ala Met Leu Glu 
15 10 15 



30 



Pro Leu Thr Val Asp Leu Asn Pro Gin 
20 25 



(2) INFORMATION FOR SEQ ID NO: 260: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

40 

Ser His He Val Lys Lys He Asn Asn Leu Asn Lys Ser Ala Leu Lys 
15 10 15 

Tyr Tyr Gin Leu Phe Leu Asp 
45 20 



(2) INFORMATION FOR SEQ ID NO: 261: 

50 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid* 
(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

Phe Thr His Leu Ser Thr Cys Leu Leu Ser Leu Leu Leu Val Arg Met 
15 10 15 



60 Ser Gly Phe Leu Leu Leu Ala Arg Ala Ser Pro Ser He Cys Ala Leu 
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20 



25 



30 



Asp Ser Ser Cys Phe Val Gin Glu Tyr Cys Ser Ser Tyr Ser Ser Ser 
35 40 45 



Cys Phe Leu His Gin His Phe Pro Ser Leu Leu Asp His Leu Cys Gin 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

Phe Leu Leu Leu Ala Arg Ala Ser Pro Ser lie Cys Ala Leu Asp Ser 
1 5 10 15 

Ser Cys Phe Val Gin Glu Tyr 
20 



(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

Pro Asp Gly Arg Val Thr Asn He Pro Gin Gly Met Val Thr Asp Gin 
15 10 15 

Phe Gly Met He Gly Leu Leu Thr Phe He Arg Ala Ala Glu Thr Asp 
20 25 30 

Pro Gly Met Val His Leu Ala Leu Gly Ser Asp Leu Thr Thr Leu Gly 
35 40 45 

Leu Asn Leu Asn Ser 
50 



(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 



Glu Asp Leu Leu Phe Tyr Leu Tyr Tyr Met Asn Gly Gly Asp Val Leu 
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10 



25 



348 



15 10 15 

Gin Leu Leu Ala Ala Val Glu Leu Phe Asn Arg Asp Trp Arg Tyr His 
20 25 30 

Lys Glu Glu Arg Val Trp lie Thr Arg 
35 40 



(2) INFORMATION FOR SEQ ID NO: 265: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

Val His Leu Ala Leu Gly Ser Asp Leu Thr Thr Leu Gly Leu Asn Leu 
20 1 5 10 15 

Asn Ser Pro Glu Asn Leu Tyr Pro 
20 



(2) INFORMATION FOR SEQ ID NO: 266: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

35 His Asn Glu Asp Phe Pro Ala Leu Pro Gly Ser 
15 10 



40 (2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 



50 



Gly Arg lie He Asp Thr Ser Leu Thr Arg Asp Pro Leu Val He Glu 
15 10 15 

Leu Gly Gin Lys Gin Val He Pro Gly Leu Glu Gin Ser Leu Leu Asp 
20 25 30 



Met Cys Val Gly Glu Lys Arg Arg Ala He He Pro Ser His Leu Ala 
55 35 40 45 

Tyr Gly Lys Arg Gly Phe Pro Pro Ser Val Pro Ala Asp Ala Val Val 
50 55 60 

60 Gin Tyr Asp Val Glu Leu He Ala Leu He Arg 
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65 70 75 



5 (2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 



15 



20 



35 



50 



He His Tyr Thr Gly Ser Leu Val Asp Gly Arg He He Asp Thr Ser 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 269: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

Cys Glu Ser Pro Glu Ser Pro Ala Gin Pro Ser Gly Ser Ser Leu Pro 
30 1 5 10 15 

Ala Trp Tyr His 
20 



(2) INFORMATION FOR SEQ ID NO: 270: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

45 Glu Glu Ala Gly Ala Gly Arg Arg Cys Ser His Gly Gly Ala Arg Pro 
15 10 15 

Ala Gly Leu Gly Asn Glu Gly Leu Gly Leu Gly Gly Asp Pro Asp His 
20 25 30 



Thr Asp Thr Gly Ser Arg Ser Lys Gin Arg He Asn Asn Trp Lys Glu 
35 40 45 



Ser Lys His Lys Val He Met Ala Ser Ala Ser Ala Arg Gly Asn Gin 

55 50 55 60 

Asp Lys Asp Ala His Phe Pro Pro Pro Ser Lys Gin Ser Leu Leu Phe 

65 70 75 80 



60 Cys Pro Lys Ser Lys Leu His He His Arg Ala Glu He Ser Lys 
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85 



90 



95 



(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

Ser Lys Gin Arg lie Asn Asn Trp Lys Glu Ser Lys His Lys Val lie 
15 10 15 

Met Ala Ser Ala Ser Ala Arg 
20 



(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

Leu Phe His Trp Ala Cys Leu Asn Glu Arg Ala Ala Gin Leu Pro Arg 
15 10 15 

Asn Thr Ala Xaa Ala Gly Tyr Gin Cys Pro Ser Cys Asn Gly Pro Ser 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

Phe Tyr lie Tyr Tyr Arg Pro Thr Asp Ser Asp Asn Asp Ser Asp Tyr 
15 10 15 

Lys Lys Asp Met Val Glu Gly Asp Lys Tyr Trp His Ser He Ser His 
20 25 30 

Leu Gin Pro Glu Thr Ser Tyr Asp He Lys Met Gin Cys Phe Asn Glu 
35 40 45 

Gly Gly Glu Ser Glu Phe Ser Asn Val Met He Cys Glu Thr Lys Ala 
50 55 60 

Arg Lys Ser Ser Gly Gin Pro Gly Arg Leu Pro Pro Pro Thr Leu Ala 
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65 



70 



75 



80 



Pro Pro Gin Pro Pro Leu Pro Glu Thr He Glu Arg Pro Val Gly Thr 
85 90 95 

Gly Ala Met Val Ala Arg Ser Ser Asp Leu Pro Tyr Leu He Val Gly 
100 105 HO 



Val Val Leu Gly Ser He Val Leu He He Val Thr Phe He Pro Phe 
10 115 120 125 

Cy S Leu Trp Arg Ala Trp Ser Lys Gin Lys His Thr Thr Asp Leu Gly 
130 135 140 

15 Phe Pro Arg Ser Ala Leu Pro Pro Ser Cys Pro Tyr Thr Met Val Pro 
145 150 155 160 

Leu Gly Gly Leu Pro Gly His Gin Ala Val Asp Ser Pro Thr Ser Val 
165 170 175 



20 



Ala Ser Val Asp Gly Pro Val Leu Met 
180 185 



25 

(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

Tyr He Tyr Tyr Arg Pro Thr Asp Ser Asp Asn Asp Ser Asp Tyr Lys 
35 1 5 10 15 

Lys Asp Met Val Glu Gly Asp Lys Tyr Trp His Ser He Ser His Leu 
20 25 30 

40 Gin Pro Glu Thr Ser Tyr Asp He Lys Met Gin Cys Phe Asn Glu Gly 
35 40 45 

Gly Glu Ser Glu Phe Ser Asn Val Met He Cys Glu Thr Lys Ala Arg 
50 55 60 

45 

Lys Ser 
65 



50 

(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

Asn Val Arg Ala Leu Leu His Arg Met Pro Glu Pro Pro Lys He Asn 
60 1 5 10 15 



WO 98/42738 



PCT/US98/05311 



352 



Thr Ala Lys Phe Asn Asn Asn Lys Arg Lys Asn Leu Ser Leu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

15 Asn Thr Asn Gin Arg Glu Ala Leu Gin Tyr Ala Lys Asn Phe Gin Pro 
1 5 10 15 



20 



35 



Phe Ala Leu Asn His Gin Lys Asp lie Gin Val Leu Met Gly Ser Leu 
20 25 30 

Val Tyr Leu Arg Gin Gly He Glu Asn Ser Pro Tyr Val His Leu Leu 
35 40 45 



Asp Ala Asn Gin Trp Ala Asp He Cys Asp He Phe Thr Arg Asp Ala 
25 50 55 60 

Cys Ala Leu Leu Gly Leu Ser Val Glu Ser Pro Leu Ser Val Ser Phe 
65 70 75 80 

30 Ser Ala Gly Cys Val Ala Leu Pro Ala Leu He Asn He Lys Ala Val 

85 90 95 



He Glu Gin Arg Gin Cys Thr Gly Val Trp Asn Gin Lys Asp Glu Leu 
100 105 HO 

Pro He Glu Val Asp Leu Gly Lys Lys Cys Trp Tyr His Ser He Phe 
115 120 125 



Ala Cys Pro He Leu Arg Gin Gin Thr Thr Asp Asn Asn Pro Pro Met 
40 130 135 140 

Lys Leu Val Cys Gly His He He Ser Arg Asp Ala Leu Asn Lys Met 
145 150 155 160 

45 Phe Asn Gly Ser Lys Leu Lys Cys Pro Tyr Cys Pro Met Glu Gin Ser 
165 170 175 



50 



Pro Gly Asp Ala Lys Gin He Phe Phe 
180 185 



(2) INFORMATION FOR SEQ ID NO: 277: 

55 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

60 
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Ser Tyr Leu Ser Ala Cys Phe Ala Gly Cys Asn Ser Thr Asn Leu Thr 
15 10 15 

Gly Cys Ala Cys Leu Thr Thr Val Pro Ala Glu Asn Ala Thr Val Val 
5 20 25 30 

pro Gly Lys Cys Pro Ser Pro Gly Cys Gin Glu Ala Phe Leu Thr Phe 
35 40 45 

10 Leu Cys Val Met Cys lie Cys Ser Leu He Gly Ala Met Ala Arg His 
50 55 60 

Pro 
65 

15 



(2) INFORMATION FOR SEQ ID NO: 278: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 



25 



Pro Ser Val He He Leu He Arg Thr Val Ser Pro Glu Leu Lys Ser 
1 5 10 15 



Tyr Ala Leu Gly Val Leu Phe Leu Leu Leu Arg Leu Leu Gly Phe He 
30 20 25 30 

Pro Pro Pro Leu He Phe Gly Ala Gly He Asp Ser Thr Cys Leu Phe 
35 40 45 

35 Trp Ser Thr Phe Cys Gly Glu Gin Gly Ala Cys Val Leu Tyr Asp Asn 
50 55 60 

Val Val Tyr Arg Tyr Leu Tyr Val Ser He Ala He Ala Leu Lys Ser 
65 70 75 80 

40 

Phe Ala Phe He 



45 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC7TH: 182 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

r 

Gin Ser Leu Phe Thr Arg Phe Val Arg Val Gly Val Pro Thr Val Asp 
55 1 5 10 15 

Leu Asp Ala Gin Gly Arg Ala Arg Ala Ser Leu Cys Xaa Xaa Tyr Asn 
20 25 30 



60 Trp Arg Tyr Lys Asn Leu Gly Asn Leu Pro His Val Gin Leu Leu Pro 

\ 
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35 40 45 

Glu Phe Ser Thr Ala Asn Ala Gly Leu Leu Tyr Asp Phe Gin Leu lie 
50 55 60 

5 

Asn Val Glu Asp Phe Gin Gly Val Gly Glu Ser Glu Pro Asn Pro Tyr 
65 70 75 80 

Phe Tyr Gin Asn Leu Gly Glu Ala Glu Tyr Val Val Ala Leu Phe Met 
10 85 90 95 

Tyr Met Cys Leu Leu Gly Tyr Pro Ala Asp Lys lie Ser He Leu Thr 
100 105 110 

15 Thr Tyr Asn Gly Gin Lys His Leu He Arg Asp He He Asn Arg Arg 
115 120 125 

Cys Gly Asn Asn Pro Leu He Gly Arg Pro Asn Lys Val Thr Thr Val 
130 135 140 

20 

Asp Arg Phe Gin Gly Gin Gin Asn Asp Tyr He Leu Leu Ser Leu Val 
145 150 155 160 

Arg Thr Arg Ala Val Gly His Leu Arg Asp Val Arg Arg Leu Val Val 
25 165 170 175 

Ala Met Ser Arg Ala Arg 
180 

30 

(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

40 Leu Val Lys Glu Ala Lys He He Ala Met Thr Cys Thr His Ala Ala 
15 10 15 

Leu Lys Arg His Asp Leu Val Lys Leu Gly Phe Lys Tyr Asp Asn He 
20 25 30 

45 

Leu Met Glu Glu Ala Ala Gin He Leu Glu He Glu Thr Phe He Pro 
35 40 45 

Leu Leu Leu Gin Asn Pro Gin Asp Gly Phe Ser Arg Leu Lys Arg Trp 
50 50 55 60 

He Met He Gly Asp His His Gin Leu Pro Pro Val He 
65 70 75 

55 

(2) INFORMATION FOR SEQ ID NO: 281: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

5 Asp Thr Tyr Pro Asn Glu Glu Lys Gin Gin Glu Arg Val Phe Pro Xaa 
15 10 15 

Xaa Ser Ala Met Val Asn Asn Gly Ser Leu Ser Tyr Asp His Glu Arg 
20 25 30 

10 

Asp Gly Arg Pro Thr Glu Leu Gly Gly Cys Xaa Ala He Val Arg Asn 
35 40 45 

Leu His Tyr Asp Thr Phe Leu Val He Arg Tyr Val Lys Arg His Leu 
15 50 55 60 

Thr He Met Met Asp He Asp Gly Lys His Glu Trp Arg Asp Cys He 
65 70 75 80 

20 Glu Val Pro Gly Val Arg Leu Pro Arg Gly Tyr Tyr Phe Gly Thr Ser 

85 90 95 

Ser He Thr Gly Asp Leu Ser Asp Asn His Asp Val He Ser Leu Lys 
100 105 110 

25 

Leu Phe Glu Leu Thr Val Glu Arg Thr Pro Glu Glu Glu 
115 120 125 



30 

(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

Leu Lys Arg Glu His Ser Leu Ser Lys Pro Tyr Gin Gly Val Gly Thr 
40 1 5 10 15 

Gly Ser Ser Ser Leu Trp Asn Leu Met Gly Asn Ala Met Val Met Thr 
20 25 30 

45 Gin Tyr He Arg Leu Thr Pro Asp Met Gin Ser Lys Gin Gly Ala Leu 
35 40 45 

Trp Asn Arg Val Pro Cys Phe Leu Arg Asp Trp Glu Leu Gin Val His 
50 55 60 

50 

Phe Lys He His Gly Gin Gly Lys Lys Asn Leu His Gly Asp Gly Leu 
65 70 75 80 

Ala He Trp Tyr Thr 
55 85 



60 



(2) INFORMATION FOR SEQ ID NO: 283: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID WO: 283: 

Pro Gly Thr Leu Gin Cys Ser Ala Leu His His Asp Pro Gly Cys Ala 
15 10 15 

10 Asn Cys Ser Arg Phe Cys Arg Asp Cys Ser Pro Pro Ala Cys Gin Cys 
20 25 30 



15 



(2) INFORMATION FOR SEQ ID NO: 284: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

25 

Phe Leu Tyr Asp Val Leu Met Xaa His Glu Ala Val Met Arg Thr His 
15 10 15 

Gin lie Gin Leu Pro Asp Pro Glu Phe Pro Ser 
30 20 25 



35 



(2) INFORMATION F<5r SEQ ID NO: 285: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

40 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 



45 



Gly Trp Tyr Trp Cys Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 286: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

55 Met Lys Val Gly Ala Arg lie Arg Val Lys Met Ser Val Asn Lys Ala 
15 10 15 

His Pro Val Val Ser Thr His Trp Arg Trp Pro Ala Glu Trp Pro Gin 
20 25 30 

60 



WO 98/42738 



PCT/US98/05311 



357 



Met Phe Leu His Leu Ala Gin Glu Pro Arg Thr Glu Val Lys Ser Arg 
~ 35 40 45 

Pro Leu Gly Leu Ala Gly Phe He Arg Gin Asp Ser Lys Thr Arg Lys 
5 50 55 60 

Pro Leu Glu Gin Glu Thr He Met Ser Ala Ala Asp Thr Ala Leu Trp 
65 70 75 80 

10 Pro Tyr Gly His Gly Asn Arg Glu His Gin Glu Asn Glu Leu Gin Lys 
85 90 95 



15 



Tyr Leu Gin Tyr Lys Asp Met His Leu Leu Asp Ser Gly Gin Ser Leu 
100 105 110 

Gly His Thr His Thr Leu Gin Gly Ser His Asn Leu Thr Ala Leu Asn 
115 120 125 



He 



20 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 287: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 <xi> SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

Ser Leu His Lys Asn Ser Val Ser Gin He Ser Val Leu Ser Gly Gly 
15 10 15 

35 Lys Ala Lys Cys Ser Gin Phe Cys Thr Thr Gly Met Asp Gly Gly Met 
20 25 30 



Ser lie Trp Asp Val Lys Ser Leu Glu Ser Ala Leu Lys Asp Leu Lys 
35 40 45 



He 



45 

(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

Glu Ala Ser Lys Ser Ser His Ala Gly Leu Asp Leu Phe Ser Val Ala 
55 1 5 10 15 

Ala Cys His Arg Phe 
20 



60 
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(2) INFORMATION FOR SEQ ID NO; 289: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 21 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

10 Tyr Met Gly Lys Gly Ser Met Thr Gly Leu Ala Leu Lys His Met Phe 
15 10 15 

Glu Arg Ser Phe Thr 
20 

15 



(2) INFORMATION FOR SEQ ID NO: 290: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

25 

Val Thr Gly He He Asp Ser Leu Thr He Ser Pro Lys Ala Ala Arg 
15 10 15 

Val Gly Leu Leu Gin Tyr Ser Thr Gin Val His 
30 20 25 



(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

Thr Glu Phe Thr Leu Arg Asn Phe Asn Ser Ala Lys Asp Met Lys Lys 
15 10 15 

45 Ala Val Ala His Met Lys Tyr Met 
20 



35 



40 



50 (2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

Gly Lys Gly Ser Met Thr Gly Leu Ala Leu Lys His Met Phe Glu Arg 
1 5 10 15 

60 
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35 



45 



55 



359 



Ser Phe Thr Gin Gly Glu Gly Ala Arg Pro Phe 
20 25 



(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

Ser Thr Arg Val Pro Arg Ala Ala lie Val Phe Thr Asp Gly Arg Ala 
15 1 5 10 15 

Gin Asp Asp Val Ser Glu Trp Ala Ser Lys Ala Lys Ala Asn Gly lie 
20 25 30 

20 Thr Met Tyr Ala Val Gly Val Gly Lys Ala He Glu 
35 40 



25 (2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

Glu Glu Leu Gin Glu He Ala Ser Glu Pro Thr Asn Lys His Leu Phe 
15 10 15 

Tyr Ala Glu Asp Phe Ser Thr Met Asp Glu He Ser Glu Lys Leu Lys 
20 25 30 



Lys Gly He Cys Glu Ala Leu Glu Asp Ser 
40 35 40 



(2) INFORMATION FOR SEQ ID NO: 295: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

Thr Gin Arg Leu Glu Glu Met Thr Gin Arg Met 
15 10 



(2) INFORMATION FOR SEQ ID NO: 296: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

Pro Gin Gly Cys Pro Glu Gin Pro Leu His 
15 10 



10 (2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 



20 



25 



Arg Cys Lys Lys Cys Thr Glu Gly Pro He Asp Leu Val Phe Val He 
15 10 15 

Asp Gly Ser Lys Ser Leu Gly Glu Glu Asn Phe Glu Val Val Lys Gin 
20 25 30 

Phe 



30 



(2) INFORMATION FOR SEQ ID NO: 298: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 

Met Ala Ala Leu Leu Leu Arg His Val Gly Arg His Cys Leu Arg Ala 
15 10 15 

40 His Phe Ser Pro Gin Leu Cys He Arg Asn Ala Val Pro Leu Gly Thr 
20 25 30 



45 



Thr Ala Lys Glu Glu Met Glu Arg Phe Trp Asn Lys Asn He Gly Ser 
35 40 45 

Asn Arg Pro Leu Ser Pro His He Thr He Tyr Ser 
50 55 60 



50 

(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 299: 



Val Phe Pro Leu Met Tyr His Thr Trp Asn Gly He Arg His Leu Met 
60 15 10 15 
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Trp Asp Leu Gly Lys Gly Leu Lys lie Pro Gin Leu Tyr Gin Ser Gly 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

Met Ala Ala Leu Leu Leu Arg His Val Gly Arg His Cys Leu Arg Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 301: 

Val Lys Ser Leu Cys Leu Gly Pro Ala Leu lie His Thr Ala Lys Phe 
15 10 15 

Ala Leu 



(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Val Phe Pro Leu Met Tyr His Thr Trp Asn Gly lie Arg His Leu Met 
15 10 15 

Trp Asp Leu Gly Lys Gly Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 303: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 303: 

5 

Arg Val Trp Asp Val Arg Pro Phe Ala Pro Lys Glu Arg Cys Val Lys 
15 10 15 

He Phe Gin Gly Asn Val 
10 20 



15 



(2) INFORMATION FOR SEQ ID NO: 304: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 304: 

His Asn Phe Glu Lys Asn Leu Leu Arg Cys Ser Trp Ser Pro Asp Gly 
15 10 15 

25 Ser Lys He Ala Ala Gly Ser Ala Asp Arg Phe Val Tyr Val 
20 25 30 



30 (2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 



40 



Trp Asp Thr Thr Ser Arg Arg He Leu Tyr Lys Leu Pro Gly His Ala 
15 10 15 

Gly Ser He Asn Glu Val Ala Phe His Pro Asp Glu Pro He 
20 25 30 



45 

(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 20 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

Val Arg Gly Arg Thr Val Leu Arg Pro Gly Leu Asp Ala Glu Pro Glu 
55 1 5 10 15 

Leu Ser Pro Glu 
20 



60 
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(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

10 Glu Gin Arg Val Leu Glu Arg Lys Leu Lys Lys Glu Arg Lys Lys Glu 
15 10 15 

Glu Arg Gin 

15 



(2) INFORMATION FOR SEQ ID NO: 308: 

20 (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

25 

Arg Leu Arg Glu Ala Gly Leu Val Ala Gin His Pro Pro 
1 5 10 



30 

(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

Gly Arg lie Pro Ala Pro Ala Pro Ser Val Pro Ala Gly Pro Asp Ser 
40 1 5 10 15 

Arg 



45 



(2) INFORMATION FOR SEQ ID NO: 310: 



(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

55 Thr Gly Cys Val Leu Val Leu Ser Arg Asn Phe Val Gin Tyr Ala Cys 
x 5 10 15 

Phe Gly Leu Phe Gly He He Ala Leu Gin Thr He Ala Tyr Ser He 
20 25 30 

60 
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Leu Trp Asp lieu Lys Phe Leu Met Arg Asn 
35 40 



5 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

Ser Arg Ser Glu Gly Lys Ser Met Phe Ala Gly Val Pro Thr Met Arg 
15 1 5 10 15 

Glu Ser Ser Pro Lys Gin Tyr Met Gin Leu Gly Gly Arg Val Leu Leu 
20 25 30 

20 Val Leu Met Phe Met Thr Leu Leu His Phe Asp Ala Ser Phe Phe Ser 
35 40 45 

He Val Gin Asn He Val Gly 
50 55 

25 



(2) INFORMATION FOR SEQ ID NO: 312: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

35 

Gly Thr Ala Glu Asp Phe Ala Asp Gin Phe Leu Arg Val Thr Lys Gin 
15 10 15 

Tyr Leu Pro His Val Ala Arg Leu Cys Leu He Ser Thr Phe Leu Glu 
40 20 25 30 

Asp Gly He Arg Met Trp Phe Gin Trp Ser Glu Gin Arg Asp Tyr He 
35 40 45 

45 Asp Thr Thr Trp Asn Cys Gly Tyr Leu Leu Ala Ser 
50 55 60 



50 (2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

Ala Ser Phe Leu Leu Ser Arg Thr Ser Trp Gly Thr Ala Leu Met He 
15 10 15 

60 
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Leu 



(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

Leu Met Arg Asn Glu Ser Arg Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

Ala Ser Phe Leu Leu Ser Arg Thr Ser Trp Gly Thr Ala 
15 10 



(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

Phe He Ser Phe Ala Asn Ser Arg Ser Ser Glu Asp Thr Lys Gin Met 
15 10 15 

Met Ser Ser Phe 
20 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

Asp Pro Arg Arg Pro Asn Lys Val Leu Arg Tyr Lys Pro Pro Pro Ser 
15 10 15 

Glu Cys Asn Pro Ala Leu Asp Asp Pro Thr Pro 
20 25 



WO 98/42738 



PCT/US98/05311 



366 



(2) INFORMATION FOR SEQ ID NO: 318: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

Asp Tyr Met Asn Leu Leu Gly Met lie Phe Ser Met Cys Gly Leu Met 
15 10 15 

15 Leu Lys Leu Lys Trp Cys Ala Trp Val Ala Val Tyr Cys Ser 
20 25 30 



20 (2) -INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 



30 



Met Leu Ser lie Ser Ala Val Val Met Ser Tyr Leu Gin Asn Pro Gin 
15 10 15 

Pro Met Thr Pro Pro Trp 
20 



35 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

Ala Ala Gly Asp Gly Asp Val Lys Leu Gly Thr Leu Gly Ser Gly Ser 
45 1 5 10 15 

Glu Ser Ser Asn Asp Gly Gly Ser Glu Ser Pro Gly Asp Ala Gly Ala 
20 25 30 

50 Ala Ala Xaa Gly Gly Gly Trp Ala Ala Ala Ala Leu Ala Leu Leu Thr 
35 40 45 

Gly Gly Gly Glu 
50 

55 



(2) INFORMATION FOR SEQ ID NO: 321: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

5 

Ala Ala Asp Asn Tyr Gly lie Pro Arg Ala Cys Arg Asn Ser Ala Arg 
15 10 15 

Ser Tyr Gly Ala Ala Trp Leu Leu Leu Xaa Pro Ala Gly Ser Ser Arg 
10 20 25 30 

Val Glu Pro Thr Gin Asp He Ser He Ser Asp Gin Leu Gly Gly Gin 
35 40 45 

15 Asp Val Pro Val Phe Arg Asn Leu Ser Leu Leu Val Val Gly Val Gly 
50 55 60 

Ala Val Phe Ser Leu Leu Phe His Leu Gly Thr Arg Glu Arg Arg Arg 
65 70 75 80 

20 

Pro His Ala Xaa Glu Pro Gly Glu His Thr Pro Leu Leu Ala Pro Ala 
85 90 95 

Thr Ala Gin Pro Leu Leu Leu Trp Lys His Trp Leu Arg Glu Xaa Ala 
25 100 105 110 

Phe Tyr Gin Val Gly lie Leu Tyr Met Thr Thr Arg Leu He Val Asn 
115 120 125 

30 Leu Ser Gin Thr Tyr Met Ala Met Tyr Leu Thr Tyr Ser Leu His Leu 
130 135 140 

Pro Lys Lys Phe He Ala Thr He Pro Leu Val Met Tyr Leu Ser Gly 
145 150 155 160 

35 

Phe Leu Ser Ser Phe Leu Met Lys Pro He Asn Lys Cys He Gly Arg 
165 170 175 

Asn 

40 



(2) INFORMATION FOR SEQ ID NO: 322: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

50 

Arg He Thr Asp Asn Pro Glu Gly Lys Trp Leu Gly Arg Thr Ala Arg 
15 10 15 

Gly Ser Tyr Gly Tyr He Lys Thr Thr Ala Val Glu He Xaa Tyr Asp 
55 20 25 30 

Ser Leu Lys Leu Lys Lys Asp Ser LeU Gly Ala Pro Ser Arg Pro He 
35 40 45 

60 Glu Asp Asp Gin Glu Val Tyr Asp Asp Val Ala Glu Gin Asp Asp He 
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50 



55 



60 



Ser Ser His Ser Gin Ser Gly Ser Gly Gly lie Phe Pro Pro Pro Pro 
65 70 75 80 

5 

Asp Asp Asp He Tyr Asp Gly He Glu Glu Glu Asp Ala Asp Asp Gly 
85 90 95 

Phe Pro Ala Pro Pro Lys Gin Leu Asp Met Gly Asp Glu Val Tyr Asp 
10 100 105 HO 

Asp Val Asp Thr Ser Asp Phe Pro Val Ser Ser Ala Glu Met Ser Gin 
115 120 125 

15 Gly Thr Asn Val Gly Lys Ala Lys Thr Glu Glu Lys Asp Leu Lys Lys 
130 135 140 

Leu Lys Lys Gin Xaa Lys Glu Xaa Lys Asp Phe Arg Lys Lys Phe Lys 
145 150 155 160 



20 



Tyr Asp Gly Glu He Arg Val Leu Tyr Ser Thr Lys Val Thr Thr Ser 
165 170 175 



He Thr Ser Lys Lys Trp Gly Thr Arg Asp Leu Gin Val Lys Pro Gly 
25 180 185 190 

Glu Ser Leu Glu Val He Gin Thr Thr Asp Asp Thr Lys Val Leu Cys 
195 200 205 

30 Arg Asn Glu Glu Gly Lys Tyr Gly Tyr Val Leu Arg Ser Tyr Leu Ala 
210 | 215 220 



35 



Asp Asn Asp Gly Glu He Tyr Asp Asp He Ala Asp Gly Cys He Tyr 
225 230 235 240 

Asp Asn Asp 



40 



(2) INFORMATION FOR SEQ ID NO: 323: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 



50 



Ser Met Ser Ala Leu Thr Arg Leu Ala Ser Phe Ala Arg Val Gly Gly 
15 10 15 

Arg Leu Phe Arg Ser Gly Cys Ala Arg Thr Ala Gly Asp Gly Gly Val 
20 25 30 



Arg His Ala Gly Gly Gly Val His He Glu Pro Arg Tyr Arg Gin Phe 
55 35 40 45 

Pro Gin Leu Thr Arg Ser Gin Val Phe Gin Ser Glu Phe Phe Ser Gly 
50 55 60 



60 Leu Met Trp Phe Trp He Leu Trp Arg Phe Trp His Asp Ser Glu Glu 
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65 70 75 80 

Val Leu Gly His Phe Pro Tyr Pro Asp Pro Ser Gin Trp Thr Asp Glu 
85 90 95 

Glu Leu Gly He Pro Pro Asp Asp Glu Asp 
100 105 
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Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas. Virginia 20110-2209 
United States of America 



Date of deposit February 25, 1998 



Accession Number 20964 1 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
dumber of Deposit") 



For receiving Office use only . 



p^j ^is sheet was received with the international application 



Authorized officer 



■ For International Bureau use only • 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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Applicants or agent's tile 
reference number 



Z0Q4PCT 



International application^. . ^ LJliassignei 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule Ubis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 75 - line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet i j 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution ( including postal code and country) 

10801 Universirv Boulevard 
Manassas. Virginia 20110-2209 
United States of America 



Date of deposit July 24, 1997 



Accession Number 209179 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet J 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit") 



For receiving Office use only . 



sheet was received with the international application 



Authorized officer 



/ 



) 



■ft. 8. 



- For International Bureau use onlv . 



□ 



This sheet was received by the international Bureau on: 



Authorized officer 
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Applicant s or agent s tile 
reference number 



Z004PCT 



International ar^lication . Unassigned. 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 77 .line N/A 



B. IDENTIFICATION OF DEPOSIT Furtner deposits are identified on an additional sheet r] 

Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas. Virginia 20110-2209 
United States of America 



Dae of deposit March 7, 1997 



Accession Number 97924 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet : H 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ij the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature ojthe indications, e.g.. "Accession 
Number of Deposit") 



For receiving Office use only . 



p^ Jttis sheet was received with the international application 



Authorized officer 



• For International Bureau use onlv - 



□ 



This sheet was received bv the International Bureau on: 



Authorized officer 
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Applicant's or agent's file 


Z004PCT 


International application ^. IJnassigned 


reference number 





INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 80 ■ line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit March 13. 1997 



Accession Number 97958 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (If the indications are not for all designated Stat*) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {spectfythe general nature of the indications, e.g. Accession 
Number of Deposit") 



For receiving Office use only . 



0* 



sheet was received with the international application 



Authorized officer 



• For International Bureau use onlv < 



□ 



This sheet was received bv the International Bureau on: 



Authorized officer 



WO 98/42738 
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Applicant s or agent's file 
reference number 



Z004PCT 
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International application . jJnassiened 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \lbis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 80 * line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet rj 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manassas. Virginia 20 110-2209 
United States of America 



Date of deposit May 22, 1 997 



Accession Number 209072 



C. ADDITIONAL INDICATIONS (leave blank if not applicable} This information is continued on an additional sheet j ^] 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for alt designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g.. "Accession 
Number of Deposit") 



For receiving Office use only ■ 



|~j^ Pn1s sheet was received with the international application 



Authorized officer 



/ JltJ/AAUJU 



JUL, 



■ For International Bureau use onlv < 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



WO 98/42738 



PCT/US98/05311 
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Applicant's or agent's file Z004PCT 
reference number 



International application . . Unassiened 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 80 -line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet r] 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas. Virginia 201 10-2209 
United States of America 



Date of deposit September 4, 1997 



Accession Number 209235 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet J 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E- SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g.. "Accession 
Number of Deposit") 



For receiving Office use only ■ 



p^j ^hls sheet was received with the international application 



Authorized otficer 



■ For International Bureau use only • 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



WO 98/42738 
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Applicant's or agent's tile 
reference number 



Z004PCT 



International application 



Unassigned 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 84 . line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet rj 



Name of depositary institution 

American Type Culture Collection 



Address of depositary insiitution (including postal code and country) 

10801 University Boulevard 
Manassas. Virginia 201 10-2209 
United States of America 



Date of deposit August 28, 1 997 



Accession Number 209226 



C ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications, e.g.. "Accession 
Number of Deposit") 



For receiving Office use only . 



|— ^JTTus sheet was received wiih the international application 



Authorized officer 



• For International Bureau use only < 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



WO 98/42738 
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Applicant's or agent's file 
reference number 



Z004PCT 



Intemaiional application .. Unassigpeji 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 84 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet rj 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas. Virginia 20110-2209 
United States of America 



Date of deposit March 1 3, 1 997 



Accession Number 97957 



C. ADDITIONAL INDICATIONS /leave blank if not applicable) This information is continued on an additional sheet rj 



D> DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ij the indications art not for alt designated States) 



E. SEPARATE FURNISHING OF INDICATIONS /leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g.. "Accession 
Number of Deposit") 



For receiving OfFice use only . 



p^j -^his sheet was received with the international application 



Authorized officer 



• For International Bureau use onlv • 



□ 



Thia sheet was received by the International Bureau on: 



Authorized officer 



WO 98/42738 



PCT/US98/05311 
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Applicant s or agent s tile 
reference number 



Z004PCT 



International application . Unassiened 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel3to; 



A. The indications made below relate to the microorganism referred to in the description 
on page 84 , line N/A 



a IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q 



Name of depositary institution 



American Type Culture Collection 



AddressDt" depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 10*2209 
United States of America 



Date ot deposit May 22, 1 997 


Accession Number 209073 


C. ADDITIONAL INDICATIONS fieave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (If the indications are not for ait designated States) 



E. SEPARATE FURNISHING OF INDICATIONS Heave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications, e.g.. "Accession 
Number of Deposit") 



For receiving Office use only . 



| ^ fhiss 



lis sheet was received with the international application 



Authorized officer 



» For International Bureau use only « 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



WO 98/42738 
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What Is Claimed Is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group 
consisting of: 

5 (a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment of 

the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO: Y or a 
polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit No:Z, 

10 which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO: Y or a 

15 polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO: Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, 
having biological activity; 

20 (0 a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID NO: Y; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 
one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 

25 hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

2 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted protein. 

30 

3 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO:Y or the polypeptide encoded by the cDNA sequence included 
in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 
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4 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X or 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X. 

5 

5 . The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

10 6 . The isolated nucleic acid molecule of claim 3 , wherein the nucleotide 

sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
15 claim 1. 

8. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1 , 

20 9. A recombinant host cell produced by the method of claim 8. 

1 0 . The recombinant host cell of claim 9 comprising vector sequences. 

11. An isolated polypeptide comprising an amino acid sequence at least 95% 
25 identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z, having biological activity; 

30 (c) a polypeptide domain of SEQ ID NO: Y or the encoded sequence included in 

ATCC Deposit No:Z; 

(d) a polypeptide epitope of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO:Y or the encoded sequence included in 
35 ATCC Deposit No:Z; 

(f) a full length protein of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 
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(g) a variant of SEQ ID NO: Y; 

(h) an allelic variant of SEQ ID NO: Y; or 

(i) a species homologue of the SEQ ID NO: Y. 

1 2 . The isolated polypeptide of claim 1 1 , wherein the secreted form or the 

5 full length protein comprises sequential amino acid deletions from either the C-terminus 
or the N-terminus. 

13. An isolated antibody that binds specifically to the isolated polypeptide of 
claim 11. 

10 

14. A recombinant host cell that expresses the isolated polypeptide of claim 

11. 



15. A method of making an isolated polypeptide comprising: 

15 (a) culturing the recombinant host cell of claim 14 under conditions such that 

said polypeptide is expressed; and 

(b) recovering said polypeptide. 

16. The polypeptide produced by claim 15. 

20 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount of 
the polypeptide of claim 1 1 or the polynucleotide of claim 1 . 

25 1 8 . A method of diagnosing a pathological condition or a susceptibility to a 

pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
30 condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of 
35 claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 
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20. A method for identifying a binding partner to the polypeptide of claim 1 1 
comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the 
polypeptide. 

2 1 . The gene corresponding to the cDNA sequence of SEQ ID NO: Y. 

22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 

23. The product produced by the method of claim 22. 



